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FIG. 1 



TECH CENTER 1600/2900 



AlaSerCysLeuAsnCysSerAlaSerll IleProAspArgGluValLeuTyrArgGlu 
1 GGCCTCCTGCTTGAACTGCTCGGCGAGCATCATACCTGACAGGGAAGTCCTCTACCGAGA 
CCGGAGGACGAACTTGACGAGCCGCTCGTAGTATGGACTGTCCCTTCAGGAGATGGCTCT 

PheAspGluMetGluGluCysSerClnHisLeuProTyrlleGluGlnGlyMetMetLeu 
6 1 GTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCT 
CAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGA 

AlaGluGlnPheLysGlnLysAlaLeuGlyLcu 
121 CGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCC 
GCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGG 



GlyCysValVallleValGlyArgValVaiLeuSerGlyLysProAlallelleProAsp 
1 CTGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTG 
GACCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGAC 

T 

ArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisLeuProTyr 
6 1 ACAGGG AAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGT 
TGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCA 
A 

IleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGlyLeuLeuGln 
121 ACATCG AGCAAGGG ATGATGCTCGCCG AGCAGTTCAAGCAG AAGGCCCTCGGCCTCCTGC 
TGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACG 

ThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLysLeu 
181 AGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAAC 
TCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTG 

GluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAlaGly 
24 1 TCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGGCGG 
AGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACCGCC 

LeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAlaVal 
3 0 1 GCTTGTCAACGCTGCCTGGTAACCGCGCCATTGCTTCATTG ATGGCTTTTACAGCTGCTG 
CGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGACGAC 

Thr S er P roLeuThrThrSe rG In 
361 TC ACCAGCCCACTAACCACTAGCCAAA 
AGTGGTCGGGTGATTGGTGATCGGTTT 



FIG. 3 
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AspAlaHisPhel^uSerGlnThrLysGlnSerClyGluAsnLeuProTyrLeuValAla 
1 GATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGC 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGLriMetTrp 
6L TACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGG 
ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACC 

LysCysLeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeu 
L 2 1 AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATAC AGACTG 
TTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGAC 

GlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCys 
1 8 1 GGCGCTGTTCAG AATG AAATCACCCTG ACGCACCCAGTCACC AAAT AC ATCATG ACATGC 
CCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACG 

MetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGIyGlyValLeuAla 
2 4 1 ATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCCGCGTCCTGGCT 
TACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGA 

AlaLeuAlaAlaTyrCysLeuSerThrGLyCysValVallleValGlyArgValValLeu 
301 GCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTG 
CGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAAC 

SerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMet: 
361 TCCCGG AAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTAC 

GluGluCysSerGlnHisLeuProTyrlleGluGInGlyMetMetLeuAlaGluGlnPhe 
4 21 GAAGAGTGCTCTCAGGACTTACCGTACATCG AGCAAGGGATGATGCTCGCCGAGCAGTTC 
CTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAG 

LysGlnLysAlaLeuGLyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaPro 
4 81 AAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCT 
TTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGA 

AlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhe 
541 GCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTC 
CGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAG 

HcSerGlyIleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlaIleAla 
601 ATGAGTGGGATACAATACTTGGCGGGCTTGTCAACGCT 

TAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGA 

Serl^imetAlaPheThrAlaAlaValThrSerProLouThrThrSerCln 
661 TCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAA 
AGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTT 



FIG. 7 



Overlap with 81 

PheThrAlaAlaValThxSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeu 
1 CTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATAT . 
GAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATA 

GlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAla 
6 1 TGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCG 
ACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGC 

GlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeu 
121 CTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCC 
GACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGG 

AlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSerGlyGlu 
181 TTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTG 
AACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCAC 

ValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeu 
241 AGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCC 
TCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGG 

valValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAla 
301 TCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCG AGGGGG 
AGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCC 

ValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
361 CAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
GTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 
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FIG. 9A 

, Serl leGl uTh r 1 1 eTh rLeuProG 1 nAspA 1 aVa 1 Se rA rgTh rG 1 n A rgA rgG 1 yA rg 
1 TCCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGG 
AGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCC 

c , JhrGlyArgGMysProGlyn 

61 ACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGC 
TGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCG 

101 MetPheAspSerSerValLeuCysGluC^ 
121 ATGTTCGACTCGTCCGTCCTCTGTG^ 

TACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAG 

, o , TDJCProAlaG 1 uThrTD rV a 1 A rgLeuArgA laTy rMetAsnThrProG 1 yLeuProVa 1 
181 ACGCCCGCCGAGACTACAGTTAGGCTAGGAGCGTACATGAACACCCCGGGGCTTCCCGTG 
TGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCAC 

AGGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGG 
Tft . Si§K?Uy§erGlnThrLys6] 

301 CACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAA 
GTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTT 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 
361 GCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGT 
CGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACA 

/0 , UyIi?^RUy LysProThrLeuHisG1 y proT hrProLeuLeuTryArgLeuGlyAla 
421 TTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCT 
AACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGA 

, 0 , M§ InAsnG lul 1 eTh rLeuTh rH i sProVa lThrLysTyr.Il eMe tTh rCy sMe tSe r 
481 GTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCG 
CAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGC 

Al a AspLeuG 1 uVa 1 Va 1 ThrSe rTh rTrpVa 1 LeuVa 1 G 1 yG 1 y Va 1 LeuA 1 aA 1 a Leu 
541 GCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTG 
CGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAAC 

cn , A 1 aA 1 ajy rCysLeuSe rTh rG 1 yCy s Va 1 Va 1 1 1 eVa 1 G 1 yA rgVa 1 Va 1 LeuSerG 1 y 
601 GCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGG 
CGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCC 

Cf - -, LysProAla 1 1 e 1 1 eProAspA rgG 1 uVa 1 LeuTy r A rgG 1 uPhe AspG 1 uMe tG 1 uG 1 u 
661 AAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAG 
TTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTC 

CysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGln 
721 TGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAG 
ACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTC 

-,0-. LysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVal I leAlaProAlaVal 
781 AAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCA6GCAGAGGTTATCGCCCCTGCTGTC 
TTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAG 




FIG. 9B 

841 CAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGT 
GTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGM 

901 GGGATACAATACTTGGCGGGCTT^ 

CCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAAC 

, Me tA 1 aPheTh rA 1 aA 1 a Va 1 Th rSe rProLeuTh rTh rSe rG 1 nTh rLeuLeuPheAsn 
961 ATGGCTTTTACAGCTGCTGTCACCAGC 

TACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGACAAGTTG 

UeLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheVal 
1021 ATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCC 

TATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACAC 

GlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsp 
1081 GGCGCTGGCTTAGCTGGCGCCGCC^ 

CCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTG 
JleLeuAlaGlyTyrGlyM 

^IPPTT^P^^TtT^^^^^^^^^^^^a^^^ttgtggcattcaagatcatgagc 

TAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCG 

G 1 yG 1 uVa 1 ProSe rTh rG 1 uA spLeuVa 1 AsnLeuLeuProA 1 a 1 1 eLeuSe rProG 1 y 
rrlrT^I^o^T^^^^^^^A^^^^T^AATCTACTGCCCGCCATCCTCTCGCCCGGA 
CCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCT 

AlaLeuValValGlyValValCysAlaAlalleLeuArgAr^ 
pCfPI^I^CGGCGTGGTra^ 

CGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTC 

1 79 1 r J X? J^X^l PBR M ?£ Sf P^Aiy 1 1 ? A JiaPheA 1 aSe r A rgG 1 yAsnH i s Va 1 Se r 
1321 GGGGCAGT6CAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
CCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCC^ 



1141 
1201 
1261 



au cd 
<pu 

3 H < 
0) H < 
i-h* CJ CD 
>iH < 
H U U 

uou 

tnO CD 
MOO 
< CD CD 
IhQCD 

>i*j: eh 

Eh H < 

u a cd 

>i*C En 
EH EH rfl 

<o a cd 
a cd 
CD cj 
CD CD 
Eh << 
. CD CD 
fO CD CD 
CD CD 
CD CD 
Eh < 



rH 



5 



M W 


r r% 


i r \ 
i—l 


r n 


C* ^— 1 f > 


f n 


<-* r \ 


r n 


CO £-1 




H3 cr* 








r \ 


f r\ 

•rn VJ^ 






r i 


rH 




*> Vrt p5 






s 


r! r j 
A- i w 


rn 




r n 






fcH «4 


NJ/ 


r 1 








r \ 




S 


<U fcH 




rH «i 


tH 






rH CJ 


CD 


CD CJ 


CD 


W <i CD 


CJ 


(0 CJ 


cd 


W CJ 


CD 


QJ C CD 


C^ 


rH CJ 


CD 


CJ 


^3 


P> rH ^1 


r 1 
fcH 




CJ 


Eh <; 


EH 


? V? H 


CD 




— ** 


rH CJ 




1 O CD 


CD 


CO <C 


Eh 


(d Eh 


<; 


1 H O 


CD 


< cd 


CJ 


> p 




1 CU CJ 


CD 




CD 


CO Eh 


< 


I 3 CJ 




x: cj 


Eh 


>iCD 




! ^fl 


< 




CJ Eh 




1 i-3 cj 




Eh 


Jh CD 


CJ 


1 Jh CD 


0 


rH CJ 


CD 


X CJ 


C3 


1 x; cj 






CJ 


Eh < 


E^ 


1 Eh < 


Eh 


rH CD 


CJ 


C! Eh 


<; 


1 <l) CJ 


CD 


Eh 






Eh 


1 rH EH 


< 


> cd 


CJ 


< < 


Eh 


! *r* ^ 


Eh 


rH CJ 


CD 


U CJ 


CD 


1 u < 


Eh 




< 


>i< 


Eh 




CD 


> cd 


CJ 


Eh Eh 


<C 


1 Eh < 


Eh 


rH CJ 


CD 




CD 


1 3 CD 




fd eh 


< 


co 


EH 


1 rH r< 


Eh 


> cd 


CJ 


< CD 


CJ 


1 CD CD 


CJ 


«H Eh 




0) <c 


Eh 


1 <D Eh 






3 


rH Eh 




1 rH Eh 


a 


> cd 




M i< 




1 m <; 


Eh 


0<Eh 




i-H CD 




1 Jh CJ 


CD 


CO <« 




fu EH 




1 CJ 


fcH 


< ^ 


CD 


P*- CD 


CD 


fcH <J 




CD 


M CD 


CD 


a\ r \ 
CD CJ 




1— 1 Cj) 


CD 




CD 


fcH 




O CD 


CD 


CO Eh 


< 


CU fcH 




<i r \ 


CD 


n.r ^ 


CD 




r n 

V—/ 


ni rn 


CD 


Cfl <cf 


EH 


rl r J 




co <: 


EH 


fa* 


CD 


PH <f! 


FH 




CD 




a 


O EH 


• ^ 


.C CD 


CD 


^ EH 


Jh O 


CD 


H < 


Eh 


Ok EH 




Or CJ 


CD 


O CD 


CD 


Q4CD 


CD 


O.CJ 


CD 


JH CD 


CD 


CO < 


EH 


CO < 


EH 


a. CD 
<D CD 


GG 


< CD 
>iCD 


GC 




CD 

% 


rH E-i 


< 


rH CD 


O 


0) EH 




M < 


Eh 


CD CD 


CD 


J CJ 


CD 


rH CD 




U CD 


CD 


JH U 


8 


OS E-i 


a 


^ CD 


TG 


0) CD 




> CD 


CD 


EH < 


cn < 


EH 


Jh CD 


CD 


M Eh 




(U CJ 




<U CD 




>i< 




£ EH 


c| 


CO Eh 




EH EH 




O. Eh 




«-H CD 


CD - 


>iCD 


CD 


CU Eh 




<0 Eh 


< 


rH CD 


CD 


CO «C 


Eh 


> CD 




CD CD 


CD 


cd 


CD 


rH 




rH 




rH 




VD 




(M 




00 






rH 




rH 





in 
n 

c 
O 



CD CD 
H Eh < 
SX CD CD 
EH < EH 
tnCD CD 
rH CD CD 

< < Eh 
>iCD CD 
rH CD CD 
CD CD CD 
CnCD CD 
U CD CD 

< U CD 
Cn Eh " 
Jh 

C 



CD CD 
U CD 



EH 



EH 

< EH 

CD CD CD 

" EH < 

CJ CD 

< Eh 
tr»CJ CD 
JH CD CJ 
< CD CD 
Jh CD CD 

H 0) CJ CD 
O CO Eh < 

rH 



FIG. 1 1 

Overlap with 32 

Me tAsnA rg Leu 1 1 eA 1 aPhe A 1 aSe rA rgG 1 yAsnH i sVa 1 Se r ProThrH i sTy rVa 1 
1 GATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGT 
CTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCA 

ProG 1 uSe r AspA 1 aA 1 aA 1 aA rgVa 1 Th r A 1 a I leLeuSe rSe r LeuTh rVa 1 Th rG 1 n 
61 GCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACCCA 
CGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGGGT 

LeuLeuArgArgLeuHisGlnTrpIleSerSerGluCysThrThrProCysSerGlySer 
121 GCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGGTTC 
CGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAG 

TrpLeuArgAspI leTrpAspTrpI leCysGluValLeuSerAspPheLysThrTrpLeu 
181 CTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGGCT 
GACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGA 

LysAlaLysLeuMetProGlnLeuProGlylleProPheValSerCysGlnArgGlyTyr 
241 AAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGGTA 
TTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCAT 

LysGlyValTrpArgVal 
301 TAAGGGGGTCTGGCGAGTG 
ATTCCCCCAGACCGCTCAC 
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FIG. 13 



CysSerLeuThrValThrGlnLeuI^euArgArgLeuHisGlnTrpIleSerSerGluCys 
1 ACTGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGT 
TGACGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCA 



ThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCysGluValLeu 
61 GTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGT 
CATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCGTGACCTATACGCTCCACA* 

Overlap with 33b 

SerAspPheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGlylleProPhe 
121 TGAGCG ACTTTAAG ACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCT 
ACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGA 



ValSerCysGlnArgGlyTyrLysGlyValTrpArgGlyAspGlylleMetHisThrArg 
181 TTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGGGG ACGGCATCATGCACACTC 
AACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCCCCTGCCGTAGTACGTGTGAG 

CysHisCysGlyAlaGlulleThrGlyHisValLysAsnGlyThrMetArglleValGly 
241 GCTGCCACTGTGGAGCTGAG ATC ACTGGAC ATGTCAAAAACGGGACG ATGAGGATCGTCG 
CGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGC 

ProArgThrCysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGly 
301 GTCCTAGGACCTGC AGG AACATGTGG AGTGGG ACCTTCCCCATTAATGCCTACACCACGG 
CAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCC 

ProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaG lu 
361 GCCCCTQTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAG 
CGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTC 

GluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMetThrThrAsp 
4 21 AGG AATATGTGGAG ATAAGGCAGGTGGGGG ACTTCCACTACGTG ACGGGTATGACTACTG 
TCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGAC 

AsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
4 81 ACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
TGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 



FIG. 14A 

AlaTyrMetSerLysAlaHisGlyl leAspProAsnl leArgThrGlyValArgThrl le 
1 TGCTTACATGTCCAAGGCTCAT666ATC6ATCCTAACATCAGGACC6GGGTGAGAACAAT 
ACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGlySerProIleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 
61 TACCACTGGCAGCCCCATCACGTACTCCACCTAC<5GCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCAC 

SerGlyGlyAlaTyrAspI lei lei leCysAspGluCysHisSerThrAspAlaThrSer 
121 aCGGGGGGCGCTTATGACATAATAA 

GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAG 

I leLeuGlyl leGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValVal 
181 CATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGG6CGAGACTGGTTGT 
GTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGAC6CCCCCGCTCTGACCAACA 

n , , LeuAlaThrAlaThrProProGlySerValThrValP leGluGluVal 
241 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

^ AlaLeuSerThrThrGlyGluI leProPheTyrGlyLysAlal leProLeuGluVal I le 
301 TGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTA 

_ LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
361 CAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGC 
GTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCG 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
421 AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGT 
TTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCA 

^IleProThrSerGlyAspValValValValAlaThrAspAlaLeuMetThrGlyTyrThr 
481 CATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATAC 
GTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATG 

„ f , GlyAspPheAspSerVal I leAspTyrAsnThrCysValThrGlnThrValAspPheSer 
541 CGGCGACTTCGACTCGGTGATAGACTACAATACGTGTGTCACCCAGACAGTCGATTTCAG 
GCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAGCTAAAGTC 

rM ' LeuAspProThrPheThrl leGluThrl leThrLeuProGlnAspAlaValSerArgThr 
601 CCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCAC 
GGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTG 

J? 1 fl&S&Q 6 1 y A r 9 Th rG 1 v A r 9G I yLysProG 1 y 1 1 eTy rA rgPheVa 1 A 1 aProG 1 y 
661 TCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGG 
AGTTGCAGCCCCCTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCC 

_ GluArgProSerGlyMetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCys 
721 GGAGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTG 
CCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGAC 

, n , A 1 aTrpTy rG 1 uLeuTh r P roA 1 aG 1 uTh rTh rVa 1 A rgLeuA rgA 1 aTy rMe t A snTh r 
781 TGCTTGGTATGAGCTCACGCCCGCC6AGACTACAGTTAGGCTACGAGCGTACATGAACAC 
ACGAACGATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTG 

o , , ,££25 J&?Mft2y a 1 CysG 1 nAspH i sLeuG 1 uPheTrpG 1 uG 1 y Va 1 PheTh rG 1 y Leu 
841 CCCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCT 
GGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGA 



FIG. 14B 

ThrHisI leAspA laH i sPheLeuSerG 1 nThrLysG 1 nSerG 1 yG 1 uAspLeuProTy r 
901 CACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTA 
GTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAAT 

LeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAsp 
961 CCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGA 
GGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCT 

GlnMetTrpLysCysLeuI leArgLeuLysProThrLeuHisGlyProThrProLeuLeu 
1021 CCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCT 
GGTCTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGA 

TyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 
1081 ATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACAT 
TATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTA 

Me tTh rCy sMe tSe rA 1 a A spLe uG 1 u Va 1 Va 1 Th r Se rTh rT rpVa 1 Le uVa 1 G 1 yG 1 y 
1141 CATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGG 
GTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCC 

ValLeuAlaAlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVal I leValGlyArg 
1201 CGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAG 
GCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTC 

Va 1 Va 1 LeuSe rG 1 yLysProA 1 a 1 1 e 1 1 eProAspA rgG 1 uVa 1 LeuTy rA rgG 1 uPhe 
1261 GGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTT 
CCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAA 

AspG 1 uMetG 1 uG 1 uCysSerG 1 nH i sLeuProTyr 1 1 eG 1 uG 1 nG 1 yMetMetLeuA 1 a 
1321 CGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGC 
GCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCG 

GluGl nPheLysG 1 nLysA 1 aLeuG 1 yLeuLeuG 1 nTh r A 1 aSe rA rgG 1 nA 1 aG 1 uVa 1 
1381 CGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGT 
GCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCA 

1 1 eA 1 aP roA 1 aVa 1 G 1 nTh rAsnTr pG 1 nLy sLeuG 1 uTh rPheTrpA 1 aLysH i sMe t 
mm TATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATAT 
ATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATA 

TrpAsnPhel leSerGlyl leGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnPro 
1501 GTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCC 
CACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGG 

AlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
1561 CGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCA 
GCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGT 

Th rLeuLeuPheAsnl leLeuG 1 yG lyTrpVa 1 A laA 1 aGl nLeuA 1 aA 1 aProG 1 yA la 
1621 AACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGC 
TTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACG 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGly 
1681 CGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGG 
GCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCC 




FIG. 14C 

LysVa 1 Leu I leAsp I 1 eLeuA 1 aG 1 yTy rG 1 y A 1 aG 1 y Va 1 A 1 aG lyA 1 aLeuVa 1 A 1 a 
1741 GAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGC 
CTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCG 

PheLysIleMetSerGlyGluValProSerThrGluAspLeuValAsnLeuLeuProAla 
1801 ATTCAAGATCATGAGCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGC 
TAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCG 

I leLeuSerProGlyAlaLeuValValGlyValValCysAlaAlal leLeuArgArgHis 
1861 CATCCTCTCGCCCGGAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCA 
GTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGT 

Va 1 G 1 y ProG 1 yG 1 uG 1 yA 1 aVa 1 G 1 nTrpMe t AsnA rgLeuI leA 1 aPheA 1 aSe rA rg 
1921 CGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCG 
GCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGC 

GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValThr 
1981 GGGGAACCATGTTTCCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCAC 
CCCCTTGGTACAAAGGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTG 

Alal leLeuSerSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpI leSer 
2041 TGCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAG 
ACGGTATGAGTCGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTC 

SerGluCysThrThrProCysSerGlySerTrpLeuArgAspI leTrpAspTrpI leCys 
2101 CTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATG * 
GAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATAC 

GluValLeuSerAspPheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGly 
2161 CGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGG 
GCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACC 

I leProPheValSerCysGlnArgGlyTyrLysGlyValTrpArgValAspGlyl leMet 
2221 GATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCAT 
CTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTA 

H i sTh r A rgCy sH i s Gy sG 1 y A 1 aG 1 u 1 1 eTh r G 1 yH i s Va 1 Lys A snG 1 yTh rMe t A rg 
2281 GCACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAG 
CGTGTGAGeGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTC 

I leValGlyProArgThrCysArgAsnMetTrpSerGlyThrPheProI leAsnAlaTyr 
2341 GATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTA 
CTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGAT 

ThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgVal 
2401 CACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGT 
GTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCA 

Se r A 1 aG 1 uG 1 uTy rVa 1 G 1 u 1 1 eA rgG 1 nVa 1 G 1 yAspPheH i sTy rVa 1 Th rG 1 yMe t 
2461 GTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTAT 
CAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATA 

ThrThrAspAsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
2521 GACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
CTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 



FIG. 15 

A 1 aVa 1 A spPhe 1 1 eProVa 1 G 1 uAsnLeuG 1 uTh rTh rMe t A rgSe rP roVa 1 PheTh r 
1 GGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCAC 
CCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTG 

AspAsnSerSerProProVa 1 Va 1 ProG InSerPheG InValAlaHl sLeuH i sA laPro 
61 GGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCC 
CCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGG 

ThrGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLysVal 
121 CACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGT 
GTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCA 

LeuVa 1 LeuAsnProSe rVa 1 A 1 aA 1 aTh rLeuG 1 yPheG 1 yA 1 aTy rMe tSe rLy sA la 
181 GCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGC 
CGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCG 

Overlap with 40b 

HisGlyl leAspProAsnl leArgThrGlyValArgThrl leThrThrGlySerProI le 
241 TCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCAT 
AGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTA 

ThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAsp 
301 CACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGA 
GTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACT 

IlellelleCysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThr 
361 CATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATTGGCAC 
GTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAACCGTG 

Va 1 LeuAspG 1 nA laG luTh r A 1 aG 1 yA 1 aA rgLeuVa 1 Va 1 LeuAlaTh rA 1 aTh rPro 
421 TGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCC 
ACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGG 

ProGlySerValThrValProHisProAsnlleGluGluValAlaLeuSerThrThrGly 
481 TCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGG 
AGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCC 

GluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeu 
541 AGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCT 
TCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGA 

I lePheCysH i sSerLysLysLysCysAspG 1 uLeuA 1 aA laLysLeuVa 1 A laLeuG I y 
601 CATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGG 
GTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCC 

1 1 eAsnA 1 a Va 1 A 1 aTy rTy rA rgG 1 yLeuAspVa 1 Se rVa 1 1 1 eProTh rSerG 1 yAsp 
661 CATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGA 
GTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCT 

ValValValValAlaThrAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSerVal 
721 TGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGT 
ACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCA 

IleAspCysAsnThrCys 
781 GATAGACTGCAATACGTGTG 
CTATCTGACGTTATGCACAC 




# 



FIG. 16 

ProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHisAlaAspVal I lePro 
1 CTCCCTGCACTT6CGGCTCCTC66ACCTTTACCTGGTCACGAG6CACGCCGATGTCATTC 
GAGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAAG 

ValArgArgArgGlyAspSerArgGlySerLeuLeuSerProArgProMeSerTyrLeu 
61 CCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCCGGCCCATTTCCTACT 
GGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGGCCGGGTAAAGGATGA 

ACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAAT 

Overlap with 

, o, ^^AlaAlaValCysTIi r A rgG 1 y Va 1 A laLysA 1 aVa 1 AspPhe 1 1 eProVa 1 G 1 uAsnLeu 
181 GGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAACC 
CCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTGG 

33c -. — 

GluThrThrMetArgSerProValPheThrAspAsnSer 
241 TAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTC 
ATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAG 

FIG. 17 

, ^GlyTrpArgLeuLeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGly 
1 GGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGG 
CCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCC 

CysnelleThrSerLeuThrGlyArgAspLysAsnGlnValGluGlyGluValGlnlle 
61 GTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGAT 
CACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTA 

, n , Va 1 SerTh rA 1 aA 1 aG 1 nThrPheLeuA 1 aTh rCy s 1 1 eAsnG 1 y Va 1 Cy sTrpTh rVa 1 
121 TGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACTGT 
ACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGACA 

, « , rJy$ I sG J yA 1 aG 1 yTh rA r 9 Th rl leAlaSe rProLysG 1 yProVa 1 1 leGlnMe tTy r 
181 CTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATGTA 
GATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTACAT 

„ , ThrAsnValAspGlnAspLeuValGlyTrpProAlaProGlnGlySerArgSe 
241 TACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGAC 
ATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTG 

Overlap with 8h 

™, . ProCysThrCysGl ySerSer AspLeuTy rLeuVa 1 Thr A rgH i s 
301 ACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACG 

TGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGC 






FIG. 18 



AsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGlyProCysThrProLeu 
1 GAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCT 
CTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGA 



P roA 1 aPro AsnTy rTh rPhe A 1 aLeuTrpA rgVa 1 Se r A 1 aG 1 uG 1 uTy rVa 1 G 1 u 1 1 e 
61 TCCTGCGCCGAACTACACGTTCGCGCTATGGAG6GTGTCTGCAGAGGAATACGTGGAGAT 
AGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATGCACCTCTA 

ArgGlnValGlyAspPheHisTyrValTtirGlyMetThrThrAspAsnLeuLysCysPro 
121 AAGGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTTAAATGCCC 
TTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAATTTACGGG 



AlaProProCysLysProLeuLeuArgGluGluValSerPheArgValGlyLeuHisGlu 
241 TGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGA 
ACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCT 

Ty rProVa 1 G 1 ySerG 1 nLeuProCysG 1 uProG 1 uProAspVa lAlaVal LeuTh rSer 
301 ATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTC 
TATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAG 



GTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCC 

SerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAla 
421 ATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGC 
TAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCG 

ThrCysThrAlaAsnHisAspSerProAsp 
481 AACTTGCACCGCTAACCATGACTCCCCTGAT 
TTGAACGTGGCGATTGGTACTGAGGGGACTA 



Overlap with 25c 



181 




361 




FIG. 19 

Overlap with 14c 

Se rSerSer A 1 aSe rG 1 nLeuSer A 1 aProSe rLeuLy sA 1 aTh rCysTh r A 1 a AspH i s 
1 AGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAACTTGCACCGCTAACCAT 
TCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTGGCGATTGGTA 



AspSerProAspA 1 aG 1 uLeuI leG 1 uA 1 aAsnLeuLeuTrpA rgG 1 nG 1 uMetG 1 yG 1 u 
61 GACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGGCAGGAGATGGGCGGC 
CTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCCG 

AsnlleThrArgValGluSerGluAsnLysValVall leLeuAspSerPheAspProLeu 
121 AACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGACTCCTTCGATCCGCTT 
TTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAA 

ValAlaGluGluAspGluArgGluI leSerValProAlaGluI leLeuArgLysSerArg 
181 GTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAATCCTGCGGAAGTCTCGG 
CACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTTCAGAGCC 

A rgPheA 1 aG 1 nA 1 aLeuProVa lTr pA 1 aA rgProAspTy rAsnProProLeuVa 1 G 1 u 
211 AGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAG 
TCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTC 

ThrTrpLysLysProAspTyrGluProProValValHisGlyCysProLeuProProPro 
301 ACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCA 
TGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACAGGCGAAGGTGGAGGT 

LysSerProProValPro 
361 AAGTCCCCTCCTGTGCCG 
TTCAGGGGAGGACACGGC 



FIG. 20 

ValTrpAlaArgProAspTyrAsnProProLeuValGluThrTrpLysLysProAspTvr 
1 CGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAACCCGACTA 
GCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTTGGGCTGAT 

Overlap with 8f -— „ 

GluProProValValHisGlyCysProLeuProProProLysSerProProValProPro 
61 CGAACCACCTGTGGTCCATGGCTGCCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCC 
GCTTGGTGGACACCAGGTACCGACGGGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGG 

ProArgLysLysArgThrValValLeuThrGluSerThrLeuSerThrAlaLeuAlaGlu 
121 GCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGGCCGA 
CGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCT 

LeuAlaThrArgSerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrThr 
181 GCTCGCCACCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGAC 
CGAGCGGTGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTG 

ThrSerSerG 1 uP roA laProSerG lyCysProProAspSerAspA laG 1 uSerPhe 
2m AACATCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTTTGC 
TTGTAGGAGACTC6GGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGAAACG 



FIG. 21 



A 1 aSe r A rgSerPheG 1 ySerSerSerTh rSe rG 1 y 1 1 eTh rG 1 yAspAsnTh rTh rTh r 
1 GCCTCCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACA 
CGGAGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGT 

Overlap with 33f 

SerSerGluProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSer 
61 TCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCC 
AGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGG 

MetProProLeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrpSerThr 
121 ATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACG 
TACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGC 

ValSerSerGluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSerTrpThr 
181 GTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACA 
CAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGT 

GlyAlaLeuValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSer 
241 GGCGCACTCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGC 
CCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCG 

AsnSerLeuLeuArgHisHisAsnLeuValTyrSerThrThrSerArgSer 
301 AACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTG 
TTGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCAC 



FIG. 22 

GlyThrTyrValTyrAsnHisLeuThrProLeuArgAspTrpAlaHisAsnGlyLeuArg 
1 GGCACCTATGTTTATAACCATCTGACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGA 
CCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCT 

AspLeuAlaValAlaValGluProValValPheSerGlnMetGluThrLysLeuIleThr 
61 GATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCACG 
CTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGC 

TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
121 TGGGGGGCAGATACCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCCGC 
ACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGGCG 



ArgG lyArgG 1 ul leLeuLeuG lyProA laAspG 1 yMetVa 1 SerLysG lyTrpArgLeu 
181 AGGGGCCGGGAGATACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGTTGGAGGTTG 
TCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCAACCTCCAAC 



LeuAlaProI leThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysI lelLeThr 
241 CTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCACC 
GACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGTGG 

Overlap with 7e 

SerLeuThrGlyArgAspLysAsnGlnValGluGlyGluValGlnlleValSerThrAla 
301 AGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCT 
TCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGACGA 



AlaGl nTh rPheLeuA 1 aTh rCy s I leAsnG 1 y Va 1 CysTrp 
361 GCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGG 
CGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACC 



FIG. 23 

GlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyrTyrLysArgTyr 
1 GGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACAAGCGCTAT 
CCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATA 

HeSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHis 
6 1 ATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCAC 
TAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTG 

ValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeuMetCys 
121 GTGTGG ATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTC ATGTGT 
CACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACA 

AlaValHisProThrLeuValPheAspIleThrLysLeuLeuLeuAlaValPheGlyPro 
181 CCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCC 
CGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGG 

LeuTrpIleLeuGlaAlaSerLeuLeuLysValProTyrPheValArgValGlnGlyLeu 
241 CTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTT 
GAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAA 

.euArgPheCysAlaLeuAlaArgLysMetlleGlyGlyHisTyrValGlnMetVallle 
301 CTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTGCAAATGGTCATC 
GAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAG 



IlcLysLeuGlyAlaLeuThrGlyThrTyrValTyrAsnHisLeuThrProLeuArgAsp 
361 ATTAAGTTAGGGGCGCTTAGTGGCACCTATGTTTATAACCATCTCACTCCTCTTCGGG AC 
TAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTG 

Overlap with It 

TrpAlaHisAsnGlyLcuArgAspLeuAlaValAlaValGluProValValPheSerGln 
4 21 TGGGCGCACAACGGCTTGCGAG ATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAA 
ACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTT 



MetGluThrLysLeuIleThrTrpGly 
4 81 ATGGAG ACCAAGCTCATCACGTGGGGGGC 
TACCTf TGGTTCGAGTAGTGCACCCCCCG 



\»«^''FIG. 24 



6 1 uTy rVa 1 Va 1 LeuLeuPheLeuLeuLeuA laAspA 1 aA rgVa 1 Cy sSerCysLeuTrp 
1 666A6TACGTCGTTCTCCTGTTCCTTCT6CTTGCAGACGCGCGCGTCT6CTCCT6CTT6T 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

, •, 0^13^15 UVkiy U eSerG 1 n A 1 aG 1 uA 1 a A 1 aLeuG 1 uAsnLeuVa 1 1 1 eLeuAsnA 1 a 
61 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGA 

CCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTAC 

, ™ A laSerLeuA 1 aG 1 yTh rH i sG 1 yLeuVa 1 SerPheLeuVa 1 PhePheCy sPheA 1 aTrp 
121 CAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCAT 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTA 

, o , ^^JytLeuLysGlyLysTrpVa 1 ProG 1 yA 1 aVa 1 Ty rTh rPheTy rG 1 yMe tTrpProLeu 
181 GGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTGGCCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAG 

LeuLeuLeuLeuLeuA 1 aLeuProG 1 nA rgA 1 aTy rA 1 aLeuAspTh rG 1 uVa 1 A 1 aA la 
241 TCCTCCTGqCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTG 

AGGAGGACGAGGACAACCGCAACGGGGTCGCCC6CATGCGCGACCTGTGCCTCCACCGGC 

Overlap with lib 

™ , SerCysGl yGl yya 1 Va 1 LeuVa 1 G 1 yLeuMet A 1 aLeuTh rLeuSer ProTy rTy rLys 
301 9^T^I^T^^5T§TT^^^^^^^^^^^^^^^CTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGT 

ArgTyrI leSerTrpCysLeuTrpTrpLeuGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGAA 
TCGCGATATAGTCGACCACGAACACCACCGAAGTCTT 

FIG. 25 

ProAlaProSerGlyCvsProP^ 
1 CCAGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCC 
GGTCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGG 

LeuGluGlyGluProGlyAspProAsp^ 
61 aGGAGGGGGAGCCTGGGGATCCGGATCnAGCGACGGGTCATGGTCAACAGTCAGTAGT 
GACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGTCAGTCATCA 

Overlap with 33g 

GluAlaAsnAlaGluAspVaW^ 
121 GAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCCTACTCTTGGACAGGCGCACTC 
CTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGGATGAGAACCTGTCCGCGTGAG 

101 ValThrProCysAlaAlaGluGluGlnLysLeuP^ 

181 GTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTGAGCAACTCGTTG 
CAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGACTCGTTGAGCAAC 

LeuArgHisHisAsnLeuValTyrSerThrf^ 
241 CTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAG 
GATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTC 

™ , (rY?MTi? rPheAs P A rgLeuG 1 nva 1 LeuAspSe r H 1 sTy rG 1 nAspVa 1 LeuLysG 1 y 
301 AAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAG 
TTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTC 

^ , ValLvsAIaAl aA 1 aSe rLysVa 1 LysA 1 aAsnPhe 
361 GTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTC 
CAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAAG 



GluTyrValValLeuLeuPheLeuLeuLeuAlaAspAlaArgValCysSerCysLeuTrp 
1 G6GAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetMetLeuLeuI leSerGlnAlaGluAlaAlaLeuGluAsnLeuVal I leLeuAsnAla 
61 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTTAATG 
CCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTAC 

A 1 aSe rLeu A 1 aG 1 yThrH i sG 1 y LeuVa 1 Se r PheLeuVa 1 PhePheCy sPhe A 1 aT rp 
121 CAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCAT 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTA 

TyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProLeu 
181 G6TATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTGGCCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAG 

LeuLeuLeuLeuLeuA 1 aLeuProG 1 nA rgA 1 aTy r A 1 aLeuAspThrG 1 uVa 1 A 1 aA 1 a 
241 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCG6GCGTACGC6CTGGACACGGAGGTGGCCG 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 

Se rCy sG 1 yG 1 y Va 1 Va 1 LeuVa 1 G 1 yLeuMe tA 1 aLeuTh rLeuSe rProTy rTy rLys 
301 CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGT 

ArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGC 
TCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAA6ACTGGTCTCACCTTCGCG 

LeuHisValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeu 
121 AACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTAC 
TTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATG 

MetCysA laValH i sProThrLeuVa 1 PheAspI 1 eThrLysLeuLeuLeuA laVa 1 Phe 
481 TCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCCGTCT 
AGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGA 

G 1 y ProLeuTrp 1 1 eLeuG 1 nA 1 aSe r LeuLeuLysVa 1 ProTy rPheVa 1 A rgVa 1G 1 n 
5m TCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCC 
AGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGG 

G 1 y LeuLeuArgPheCysA 1 aLeuA 1 aA rgLysMe 1 1 1 eG 1 yG 1 yH i sTy rVa lGlnMet 
601 AAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTGCAAA 
TTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTT 

Va 1 1 1 e 1 1 eLys LeuG 1 y A 1 aLeuTh rG 1 yTh rTy rVa lTy r AsnH i sLeuTh rP roLeu 
661 TGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACTCCTC 
ACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAG 

ArgAspTrpA 1 aH i sAsnG lyLeuA rgAspLeuA 1 aVa 1 A 1 aVa 1 G 1 uP roVa 1 Va 1 Phe 
721 TTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCT 
AAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACC6GCACCGACATCTCGGTCAGCAGA 

Se rG 1 nMe tG 1 uThr Lys Leu 1 1 eTh rTrpG 1 yA 1 aAspTh rA 1 a A 1 aCysG 1 yAsp 1 1 e 
781 TCTCCCAAATGGAGACCAAGCTCATCACGTG6GGGGCAGATACCGCCGCGTGCGGTGACA 
AGAGGGTTTACCTCTGGTTCGAGTA6TGCACCCCCCGTCTATG6CG6CGCACGCCACTGT 

IleAsnGlyLeuProValSerAlaArgArgGlyArgGluIleLeuLeuGlyProAlaAsp 
Sm TCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCCAGCCG 
AGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGTCGGC 

G 1 yMe tVa 1 Se r LysG 1 yTrpA rgLeuLeuA 1 aPro 1 1 eTh r A 1 aTy r A 1 aG 1 nG 1 nTh r 
901 ATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGA 
TACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGTCT 



FIG. 26B 

ArgGlyLeuLeuGlyCysIlelleThrSerLeuThrGlyArgAspLysAsnGlnValGlu 
961 CAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACT6GCCGGGACAAAAACCAAGTGG 
GTTCCCCGG AGGATCCCA CGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTTCA CC 

GlyGluValGlnlleValSerThrAlaAlaGlnThrPheLeuAlaThrCysIleAsnGly 
1021 A6GGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATCAATG 
TCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAGTTAC 

Va 1 CysTrpTh rVa 1 Ty rH i sG 1 yA 1 aG 1 yTh r A rgTh r I leA 1 aSe rProLysG 1 y Pro 
1081 GGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTC 
CCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAG 

VallleGlnMetTyrThrAsnValAspGlnAspLeuValGlyTrpProAlaProGlnGly 
1141 CT6TCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAG 
GACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTC 

SerArgSerLeuThrProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHis 
1201 GTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGC 
CATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCG 

A laAspVa 1 1 1 eProVa 1 ArgA rgArgG 1 yAspSerArgG lySerLeuLeuSerProArg 
1261 ACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCC 
TGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGG 

ProIleSerTyrLeuLysGlySerSerGlyGlyProLeuLeuCysProAlaGlyHisAla 
1321 GGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACG 
CCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGC 

ValGlyllePheArgAlaAlaValCysThrArgGlyValAlaLysAlaValAspPhelle 
1381 CCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTA 
GGCACCCGTATAAATCCCGGCGCGACACGTGGGCACCTCACCGATTCCGCCACCTGAAAT 

ProValGluAsnLeuGluThrThrMetArgSerProValPheThrAspAsnSerSerPro 
mm TCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTCTC 
AGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAGAG 

P roVa 1 Va 1 P roG 1 nSe r PheG 1 nVa 1 A 1 aH i s LeuH i s A 1 aP roTh rG 1 y SerG 1 y Lys 
1501 CACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAGCGGCA 
GTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGT 

Se rTh rLysVa 1 ProA I aA 1 aTy r A 1 aA 1 aG 1 nG 1 yTy rLysVa 1 LeuVa 1 LeuAsnPro 
1561 AAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACTCAACC 
TTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGAGTTGG 

SerVa 1 A 1 aA 1 aThr LeuG 1 yPheG 1 yA 1 aTy rMetSe rLys A 1 aH i sG 1 y 1 1 eAspPro 
1621 CCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGATCGATC 
GGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAGCTAG 

Asn 1 1 e A rgTh rG 1 y Va 1 A rgTh r 1 1 eTh rTh rG 1 ySe r Pro 1 1 eTh rTy rSe rTh rTy r 
1681 CTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCT 
6ATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAGGTGGA 

G 1 y LysPheLeuA laAspG 1 yG lyCysSerG 1 yG 1 yA 1 aTy rAsp 1 1 e I le 1 1 eCys Asp 
1741 ACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAATTTGTG 
TGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTAAACAC 

GluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThrValLeuAspGlnAla 
1801 ACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGACCAAG 
TGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACTGGTTC 

G 1 uTh r A 1 aG 1 y A 1 a A rgLe uVa 1 Va 1 Le u A 1 aTh r A 1 aTh rP roP r oG 1 y Se rVa 1 Th r 
1861 CAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTCCGTCA 
GTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGT 

Va 1 ProH i sPro Asn 1 1 eG 1 uG 1 uVa 1 A 1 aLeuSe rTh rTh rG 1 yG 1 u 1 1 e P roPheTy r 
1921 CTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCCTTTTT 
GACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGGAAAAA 




FIG. 26C 

, rt „, . 6 1 yLysA 1 a 1 1 eProLeuG luVall leLysGlyGlyArgHisLeul 1 ePheCysH I sSe r 
1981 ACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTGTCATT 
TGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAA 

o«i , r AY§U§lrY^X^§R5 luLeuAlaAlaL ysLeuValAlaLeuGlyI leAsnAlaValAla 

20,1 wmmmmmmmmmmL 
2101 ^m^^i^wmimsmmm^ h 

GGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACC 

o nc , r ,TtJ^fSfti§l:§^S tThrG1 y T y rThrG1 y A spPheAspSerValIleAspCysAsnThr 
2161 CAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTGCAATA 
GTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTAT 

n „ CysValThrGlnThrValAspPheSerLeuAspProThrPheThrlleGluThrlleThr 

2221 CGTGTGTCACCCAGACAGTCGATTTCAGCCm 

GCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGT 

- oon ^^LeuProGlnAspAlaValSerArgThrGlnArgArgGlyArgThrGlyArgGlyLysPro 
2281 CGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGGAA^ 
GCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCG 

, , Glyl leTyrArgPheVa 1 A 1 aProG 1 yG 1 uArgProSe rG 1 yMetPhe AspSe rSe rVa 1 
2341 CAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTCGTCCG 
GTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGC 

.... LeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrt 
2401 TCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTG 

AGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTCTGAT 

yalArgLeuA rgA 1 aTy rMetAsnTh rProG 1 yLeuProVa 1 CysG 1 nAspH i sLeuG 1 u 
2461 CAGTTAGGCTACGAGCGTACATG^ U 
GTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGTAGAAC 

oco i . A P^TrpGlu61yValPheThr61yLeuThrH i s 1 1 eAspA laHl sPheLeuSe rG 1 nTh r 
2521 AATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATCCCAGA 
TTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGGGTCT 

oco i r a k YJ 6 . 1 pSerGlyG 1 uAsnLeuP roTy r LeuVa 1 A 1 aTy rG 1 nA 1 aTh rVa 1 Cy sA 1 aA rg 
2581 CAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGCGCTA 
GTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACGCGAT 

o^, , ^^AlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeuI leArgLeuLysPro 
2641 GGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAA^ 

CCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAGTTCG 

^TDC LeuHisG1 y proThrpr o L euLeuTyrArgLeuGlyAlaValGlnAsnGluIleThr 

2701 m^immmimmmmism 

2761 CcfefM 

GGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTCCAGC 

2821 TCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGCCTGT 
AGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACGGACA 

0001 ^JDC^X^^^a^^ValGlyArgValValLeuSerGlyLysProAlallellePro 
2881 CAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTGCGGGAAGCCGGCAATCATAC 
GTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATG 



FIG. 26D 

TyrlleGluGlnGI yMetMetLeuA 1 aG 1 uG 1 nPheLysG 1 nLy sA 1 aLeuG ly LeuLeu 
3001 CGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCC 
GCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGG 

GInThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLys 
3061 TGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAA 
ACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTT 

LeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAla 
3121 AACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGG 
TTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACC 

„, ft , ^Gl yLeuSerTh rLeuProG 1 yAsnProA 1 a I leA 1 aSe rLeuMet A laPheThrA 1 aA 1 a 
3181 CGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACAGCTG 
GCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGAC 

ValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGlyTrpVal 
3241 CTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGGTGGG 
GACAGTGGTCGGGTGATT6GTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCCACCC 

330 1 TGGCT GCCCAGCT CGCCGCCCCCG^ 1 V 

ACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAATCGAC 

„ AlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGlyTyrGly 
3361 GCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGGTATG 
CGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCCATAC 

„ o -, ^ AlaGlyya 1 A 1 aG lyA 1 aLeuVa 1 A 1 aPheLys 1 1 eMe tSe rG 1 yG 1 uVa 1 ProSe rThr 
3421 GCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCCTCCA 
CGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGGAGGT 

GluAspLeuValAsnLeuLeuProAlal leLeuSerProGlyAlaLeuValValGlyval 
3481 CGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTCGGCG 
GCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGC 

, ^ValCysAlaAlal leLeuArgArgHisValGlyProGlyGluGlyAlaValGlnTrpMet 
3541 TGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGA 
ACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTCACCT 

AsnArgLeu 1 1 eA 1 aPheA 1 aSe rA rgG 1 yAsnHi sVa 1 SerProThrH i sTy rVa 1 Pro 
3601 TGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGTGC 
ACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCACG 

GluSerAspAlaAlaAlaArgValThrAlalleLeuSerSerLeuThrValThrGlnLeu 
3661 CGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACCCAGC 
GCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGGGTCG 

T ^LeuArgArgLeuHisGlnTrpIleSerSerGluCysThrThrProCysSerGlySerTrp 
3721 TCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGGTTCCT 
AGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAGGA 

LeuArgAspI leTrpAsPTrpI leCysGluValLeuSerAspPheLysThrTrpLeuLys 
3781 GGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGGCTAA 
CCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGATT 

A 1 aLysLeuMetProG 1 nLeuProG 1 y 1 1 eProPheVa 1 SerCysG 1 nA rgG 1 yTy rLys 
3841 AAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGGTATA 
TTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCATAT 

GlyValTrpArgValAspGlylleMetHisThrArgCysHisCysGlyAlaGluIleThr 
3901 AGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGAGATCA 
TCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACTCTAGT 




FIG. 26E 

SerG lyThrPhePro 1 1 eAsnA 1 aTyrThrThrG lyProCysThrPrpLeuProAlaPro 
4021 GGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCCTGCGC 
CCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGAAGGACGCG 

AsnTy rThrPheA 1 aLeuTrpA rgVa 1 SerA laG 1 uG 1 uTyrValG 1 u 1 1 eA rgG 1 nVa 1 
4081 CGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAG^ 

GCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCCGTCC 

GlyAspPheH i sTy rVa ITh rG 1 yMetThrThr AspAsnLeuLysCy sP roCysG 1 nVa 1 
4141 TGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTGCCAGG 
ACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCACGGTCC 

ProSerProGluPhePheThrGluLeuAspGlyValArgLeuHisArgPheAla^ 
4201 TCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGCGCa 
AGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACGCGGGG 

CysLysProLeuLeuA rgG 1 uG 1 uVa 1 Se r Phe A rgVa 1 G 1 y LeuH i sG 1 uTy rP rpVa 1 
1261 CCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATACCCGG 
GGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCTTATGGGCC 

G lySerG 1 nLeuProCysG 1 uProG 1 uProAspva 1 A 1 aVa 1 LeuThrSerMetLeuThr 
1321 TAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCATGCT^ 
ATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAGGTACGAGT 

AspProSe rH i s 1 1 eTh rA 1 aG 1 uA 1 a A 1 aG 1 V A rg ArgLeuAl aA rgGlySe r P roPro 
4381 CTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGG^ 

GACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGG 

SerValAlaSerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAl^ 
4441 CCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTC^ 

GGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGT 

AlaAsnHisAspSerProAspAlaGluLeuI leGluAlaAsnLeuLeuTrpArgGlnGlu 
4501 CCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGGCA^ 
GGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCCGTCC 

MetGlyGlyAsnlleThrArgValGluSerGluAsnLys^ 
4561 AGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGAm 

TCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTGAGGA 

AspProLeuVa 1 A 1 aG 1 uG 1 uAs pG 1 u A rgG 1 ul leSerVa 1 PrpA 1 aG 1 u 1 1 eLeuA rg 
4621 TCGATCCGCTTGTGGGGGAGGAGGACGAGCG6GAGATCTCCGTACCCGCAGAAATCCT 
AGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAGGACG 

LysSerArgArgPheA 1 aG 1 nA 1 aLeuProVa lTrpA laA rgProAspTy r AsnProPro 
4681 GGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGG^ 

CCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTGGGGG 

LeuVal G 1 uTh rTrpLysLysProAspTyrG 1 uProProVa 1 Va 1 H i sGl yCysProLeu 
4741 CGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTGTG^ 

GCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACAGGCG 

ProProProLysSerProProValProProProArgLysLysArgT^ 
4801 TTCCACCTCCAAAGTCCCCTCCTGT6CCTCCGCCTCGGAAGAAGCGGACGGTG^ 

AAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGT 

G luSerTh rLeuSe rThrA 1 aLeuA 1 aG 1 uLeuA 1 aTh rA rgSe rPheG 1 ySe rSerSer 
4861 CTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTG^ 

GACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGA 

ThrSerGlylleThrGlyAspAsnThrThrThrSerSerGlu^^ 
4921 CAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTCT^ 

GTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGGAAGACCGA 

ProProAspSe rAspA 1 aG 1 uSe rTy rSerSe rMetProProLeuG JuGlyGl uProG 1 y 
4981 GCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGM 

CGGGGGG6CTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCTCGGAC 




Tig. 26F 

AspProAspLeuSe r AspG lySerTrpSerThrVa 1 SerSerG 1 uAl aAsnA 1 aG 1 uAsp 
5041 GGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCA^ 

CCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCATCACTCCGGTTGCGCCTCC 

Va 1 Va 1 CysCy s Se rMe tSe rTy rSerTrpTh rG 1 yA 1 aLeuVa 1 Th rP roCysAlaAla 
5101 ATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTC^ 

TACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCACGCGGC 

GluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeuLeuArgHisHisAsnLeu 

5161 CGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCCT^ 

GCCTTCTTGTCTTTGACGGGTAGTTAC6TGATTCGTTGAGCAACGATGCAGTGGTGTTAA 

ValTyrSerThrThrSerArgSerAlaCysGlnArgGlnLysLysValThrPhe^ 
5221 TGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAA 

ACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTGTAAACTGT 

LeuG 1 n Va 1 Leu AspSe rH i sTy rG 1 n AspVa 1 LeuLy sG 1 uVa 1 Ly sA 1 aA 1 aA 1 aSe r 
5281 GACTGCAAGTTCTG6ACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGCGG 

CTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCGCCGCA 

LysValLysAlaAsnLeu 
5341 CAAAAGTGAAGGCTAACTTG 
GTTTTCACTTCCGATTGAAC 



FIG. 30 

GlyGlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValL 
1 GGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCAAGCGGCGTACTGACAAC^ 

CCCCCCCTCTTGAC6CCGATAGCGTCCACGGCGCGTTC6CCGCATGACTGTTGATCGACA 



61 



181 



GlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAlaMaGlyLeuGln 
GGTAACACCCTCACTTGTTACATCAAGGCCCGAGCAGCCTGTCGAGCCGCAGGGCTC^ 
CCATTGTGGGAGTGAACAATGTAGTTCCGGGCTCGTCGGACAGCTCGGCGTCCCGAGGTC 

Over 1 3p with 19g 

AspCysThrMetLeuValCysGlyAspAspLeuValVain 
121 GACTGCACCATGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTGA^ 

CTGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAG 

GlnG 1 uAspA 1 aA 1 aSerLeuArgA 1 aPheThrG 1 uA 1 aMetThrArgTyrSerAlaPro 
CAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCC 
GTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGG 

ProGlyAspProProGlnProGluTyrAspLeuGluLeuIleThrSerCysSerSerAsn 
241 CCTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCATAACATCATGCT^ 

G6ACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTG 

ValSerValAlaHisAspGlyAlaGlyLysArgValTyrTyrLeuThrArgAsp^ 
301 GTGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGT 

CACAGTCAGCGGGTGCTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGT 

ThrProLeuA 1 aArgA 1 aA 1 aTr pG luThr A 1 aA raH i sThrProVa 1 AsnSe rTrpL eu 
361 ACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCa 

TGGGGG6AGCGCTCTCGACGCACCCTCT6TCGTTCTGTGTGAGGTCAGTTAAGGACCGAT 

GlyAsnl lei leMetPheAlaProThrLeuTrpAla 
421 GGCAACATAATCATGTTTGCCCCCACACTGTGGGCG 
CCGTTGTATTAGTACAAACGGGGGTGT6ACACCCGC 
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FIG. 28 

Overlap with 39c 

LeuLysGluValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGlu 
1 TGCTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGG 
ACGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCC 

AlaCysSerLeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAsp 
6 1 AAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAG 
TTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTC 

ValArgCysHisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAspLeuLeu 
121 ACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAG ACCTTC 
TGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAG 

GluAspAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysVal 
181 TGG AAGACAATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCG 
ACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGC 

GlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyVal 
241 TTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCG 
AAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGC 

ArgValCysGluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAlaValMet 
301 TGCGCGTGTGCGAAAAGATGGCTTTGTACG ACGTGGTTACAAAGCTCCCCTTGGCCGTG A 
ACGCGCACACGCTTTTCTACCGAAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACT 

GlySerSerTyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAla 
361 TGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAG 
ACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTC 

TrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThr 
421 CGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCA 
GCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGT 

ValThrGluSerAspIleArgThrGluGluAla 
4 81 CAGTCACTGAGAGCGACATCCGTACGGAGG AGGCA 
GTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGT 



FIG. 29 



GluPheLeuValGlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThr 
1 GAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACC 
CTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGC 

Overlap with 35f 

ArgCysPheAspSerThrValThrGluSerAspIleArgThrGluGluAlalleTyrGln 
61 CGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAA 
GCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTT 

CysCysAspLeuAspProGlnAlaArgValAlalleLysSerLeuThrGluArgLeuTyr 
121 TGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTAT 
ACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATA 

ValGlyGlyProLeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArgAla 
1 8 1 GTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCG 
CAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGC 

SerGlyValLeuThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAla 
241 AGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCACTTGCTACATCAAGGCCCGGGC A 
TCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGT 

AlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeuVal 
301 GCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTAGTC 
CGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATGAG 

VallleCysGluSerAlaGlyValGlnGluAspAlaAla 
361 GTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCGAG 
CAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGCTC 
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FIG. 32A 

1 1 ePheLy s 1 1 e A rgMe tTy rVa 1 6 lyG 1 y Va 1 6 1 uH i sArgLeuG 1 uA 1 aA 1 aCy s A sn 
1 CCATATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCA 
GGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGT 

TrpThrArgGlyGluArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeu 
61 ACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGT 
TGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCA 

LeuLeuTh rTh rTh rG 1 nTrpG 1 nVa 1 LeuProCysSe r PheTh rTh rLeuProA 1 aLeu 
121 TACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCT 
ATGACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGA 

SerThrGlyLeuI leHisLeuHisGlnAsnl leValAspValGlnTyrLeuTyrGlyVal 
181 TGTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGG 
ACAGGTGGCCGGAGTAGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCC 

GlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValValLeuLeuPheLeuLeu 
241 TGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTC 
ACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAG 

LeuA 1 aAspA 1 aA rgva 1 CysSe r CysLeuTrpMetMetLeuLeu 1 1 eSe rG 1 nA 1 aG 1 u 
301 TGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGG 
ACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCC 

AlaAlaLeuGluAsnLeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeu 
361 AGGCGGCTTTGGAGAACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTC 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAG 

ValSerPheLeuValPhePheCysPheAlaTrpTyrLeuLysGlyLysTrpValProGly 
421 TTGTATCCTTCCTCGTGTTCTTCTGCTTTGCATGGTATTTGAAGG6TAAGTGGGTGCCCG 
AACATAGGAAGGAGCACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGC 

A 1 aVa lTy rThrPheTy rG 1 yMe tTrpProLeuLeuLeuLeuLeuLeuA 1 aLeuProG 1 n 
481 GAGCGGTCTACACCTTCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCC 
CTCGCCAGATGTGGAAGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGG 

A rgA 1 aTy rA 1 aLeuAspTh rG 1 uVa 1 A 1 aA 1 aSe rCysG lyG 1 yVa 1 Va 1 LeuVa 1 G 1 y 
541 AGCGGGCGTACGCGCTGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCG 
TCGCCCGCATGCGCGACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGC 

LeuMetAlaLeuThrLeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrp 
601 GGTTGATGGCGCTGACTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGT 
CCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCA 

LeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHisValTrpI leProProLeuAsn 
661 GGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCA 
CCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGT 

Va 1 ArgG lyG 1 yArgAspA 1 aVa 1 1 leLeuLeuMetCysA laVa 1 H i sProThrLeuVa 1 
721 ACGTCCGAGGGGGGCGC6ACGCCGTCATCTTACTCATGTGTGCTGTACACCCGACTCTGG 
TGCAGGCTCCCCCCGCGCTGCGGCA6TAGAATGAGTACACACGACATGTGGGCTGAGACC 

PheAspI leThrLysLeuLeuLeuAlaValPheGlyProLeuTrpIleLeuGlnA laser 
781 TATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCA 
ATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGT 

ni LeuLeuLysValProTyrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeuAla 
841 GTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAG 
CAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATC 



FIG. 32B 

A rgLy sMe 1 1 1 eG 1 y6 1 y H 1 sTy rVa 1 6 1 nMe tVa 1 1 1 e 1 1 eLvsLeuG 1 vA i ai pi iTh r 
901 CGCGGAAGATGATCGGAGGCCATTAC 

GCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAAT 

^,5I^T h C T y rValT y rA snHisLeuThrProLeuArgAspTrpAlaHisAsnGlyLeuAra 
961 CTGGCACCTATGTTTATAACCATCTCACTCCTCTTCGGGACTGG^ 

GACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACG 

„ AspLeuA 1 aVa 1 A 1 aVa 1 G 1 uProVa 1 Va 1 PheSe rG 1 nMe tG 1 uThr LysLeu 1 1 eThr 

1021 GAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTC^ 

CTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTK 

, no , ^^I/TRGlyAlaAspThrA 1 aA 1 aCysG lyAsp I lei leAsnG 1 yLeuProVa lSerAl aArg 

1081 mm^immmmmmnm 

11«G«M^ 

CGTCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCCACCTCCA 

LeuAlaProHeThrAlaTyrAlaGlnGln^ 
1201 TGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTfcATAATCA 
ACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGT 

i oc i m §^^LeuThrGl yA rgAspLy s AsnG 1 nVa 1 G 1 uG 1 yG 1 uVa 1 G 1 n 1 1 eVa 1 Se rThrA 1 a 
1261 CCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTG 
GGTCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGAC 

1 7o i r-r&Q I nThrPheLeuAlaThrCys 1 1 eAsnG 1 y Va 1 CysTrpThrVa 1 Ty rH i sG 1 yA 1 a 
1321 CTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACTGTCTACCACGGGG 
GACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCC 

, 701 ^GlyThrArgThrl leAlaSerProLysGlyProVal I leGlnMetTyrThrAsnValAsp 
1381 CCGGAACGAGGACCATCGCGTCACCCA 

GGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTACATATGGTTACATC 

GlnAspLeuVaJ^WTrpP™^ 
1W1 ACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCAcW 
TGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTGAA 

. Crtl „^G 1 ySe rSerAspLeuTyrLeuVa 1 ThrArgH i sA 1 aAspVa 1 1 1 eProVa 1 ArgArgArg 
1501 GCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACGCCGATGTCATTCCCGTGCGCCGGC 
CGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCG 

,. C1 ^GlyAspSerArgGlySerLeuLeuSerProArgProIleSerTyrLeuLy 
1561 GGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCCGGCCCATTTCCTkn^ 

CCCCACTATCGTCCCCGTCGGACGACAGCTGGGCCGGGTAAAGGATGAACTTTCCGAGGA 

1M1 G 1 yG 1 yProLeuLeuCysProA 1 aG 1 yH i s A 1 a Va 1 G 1 y 1 1 ePheArgA 1 a A 1 aVa 1 Cy s 
1621 CGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGTGGGCATATm 

GCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACA 

lcol rgG 1 yVa 1 A 1 aLy sA 1 a Va 1 AspPhe 1 1 eP roVa 1 G 1 uAsnLeuG 1 uTh rTh rMe t 

1681 GCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCA 
CGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGT 



FIG. 32C 

, -,. , ^ ArgSerProVa 1 PheThrAspAsnSerSe rProProVa 1 Va 1 ProG 1 nSe rPheG 1 nVa 1 
mi TGAGGTCCCCGGTGTTCACGGATAACK 

ACTCCAGGGGCCACAAGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCC 

, ™, ^$I3 H i sLeuHi sA 1 aProThrG 1 ySerG 1 yLysSerThrLysVa 1 ProA 1 aA 1 aTyrA la 
1801 JGGCTCACCTCCATGCTCCCACAGGCAGCGGCAAAAGCACCAAGGTCC 

ACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATAC 

, , „ A 1 aG 1 nG 1 yTy r LysVal LeuVa 1 LeuAsnProSe rVa 1 A 1 aA 1 aTh rLeuG 1 yPheG 1 y 
1861 CAGCTCAGGGCTATAA6GTGC™^ 

GTCGAGTCCCGATATTCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAAC 

, , ^-r^^I^^gCLysA laHlsGlyl 1 eAspProAsn 1 1 e ArgTh rG 1 y Va 1 A rgThr 1 1 e 
1921 GTGCTTACATGTCCAAGGCTCATGGGATCGATC^ 

CACGAAT6TACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTT 

ThrThrGlySerProIleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 

1981 ummmimmMmwm^M 

CGAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTA 

om, rrU?t?^XU^JiXT!30§Ji LeuAs P GlnA l aG l u TlirAlaGlyAlaArgLeuValVal 
2101 CCATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGk 

GGTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAAC 

oicn LeuAlaThrAlaThrProProG 1 ySerVa IThrVa 1 ProH i sProAsn I leG 1 uG 1 uVa 1 
2161 TGCTCGCCACCGCCACCCCTCCGG 

ACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCC 

^ AlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalleProLeuGluVallle 

2221 JJGCTCTGTCCACCACCGGAGAGATCCCTTTTT^ 

AACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCC6TTCCGATAGGGGGAGCTTCATT 

LysGlyGlyAr 
2281 TCAAGGGGGGGAG 



QHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
.^ ? nv»u»uuunJACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCG 
AGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGC 

„.„ r LysLeuValAlaLeyGlylleAsnAlaValAlaTyrT^ 

AGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATAT 

o/. c , rr G JiyAspPheAspSe0aineAspCysAsnThrCysValTh 
2461 CCGGCGACTTCGACTCGGTGATAGACTGCAATACGT^ 

GGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGT 

„,„ r LeuAspProThrPheThrlleGluThrlleT^ 
2521 r££TT£ACCCTACCTTCACCA^ 

CGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGT 

oco i 5?^^^f961yArgjh rG 1 y A rgG 1 yLysProG 1 y 1 1 eTy r A rgPheVa 1 A 1 aProG 1 y 
2581 CTCAACGTCGGGGCAGGACTGGCAG^ y 
GAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCC 

,,.„ rr GiuArgProSerGlyMetP^ 

2641 GGGAGCGCCCCTCCGGGATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCT 
CCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGA 

2701 GTGCTTGGTATGAGCTCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACA 
CACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGT 



3181 



FIG. 32D 

ProGlyLeuProValCysGlnAspHisLeuGluPheTrpGluGlyV^^ 
2761 CCCCGGGGCTTCCCGTGTGCCAGGACCATCTTGM^ 

GGGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGG 

ThrH i s 1 1 eAspA laH i sPheLeuSerG InThrLysG 1 nSerG lyG luAsnLeuProTy r 
2821 TCACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTT^ 

AGTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAA 

LeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAsp 

2881 ACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGG^ 

TGGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCC 

G 1 nMetTrpLysCysLeu 1 1 eArgLeuLysP roThrLeuH i sG lyProThrProLeuLeu 

2941 ACCAGATGTGGAAGTGTTTGATT^ 

TGGTCTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACG 

TyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 
3001 TATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAG^ 

ATATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGT 

MetThrCysMetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyG^ 
3061 TCATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGT 

AGTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGC 

ValLeuAlaAlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArg 

3121 GCGTCCTGGCTGCTTTGGCCGCGTATT^ 

CGCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGT 

Va 1 Va 1 LeuSerG lyLysProA 1 a 1 1 e 1 1 eProAspA rgG 1 uVa 1 LeuTyr ArgG 1 uPhe 
GGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGA^ 
CCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCA 

AsdG 1 uMe tG 1 uG 1 uCy sSe rG 1 nH i sLeuProTy r 1 1 eG 1 uG 1 nG 1 yMe tMetLeuA 1 a 
3241 TCGATGAGATGGAAGAG^ 

AGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGC 

GluGlnPheLysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGl^ 
3301 CCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGG 
GGCTCGTCAAGTtCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCC 

1 1 eA 1 aP roA 1 aVa 1 G 1 nTh r AsnTrpG 1 nLysLeuG 1 uTh r PheTrpAl aLysH i sMe t 
3361 TTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTG^ 

AATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTAT 

TrpAsnPhel leSerGlyl leGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnPro 
3421 TGTGGAAmCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTG^ 

ACACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGG 

A 1 a 1 1 e A 1 aSe rLeuMe tA 1 aPheTh rAlaAlaValThrSerProLeuTh 1 n 

3481 CCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTG^ 

GGCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGG 

ThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAla 
3541 AAACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCC^ 

TTTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCAC 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlal leGlySerValGlyLeuGly 

3601 ccgctactgcctWgtgggcgctggcttagctggm 

ggcgatgacggaaacacccgcgaccgaatcgaccgcggcggtagccgtcacaacctgacc 

LysValLeuIleAspIleLeuAlaGlyTyrGlyAlaGlyV^ 
3661 GGAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGG^ 

CCTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGA6AACACC 

PheLys 1 1 eMetSe rG 1 yG 1 uVa 1 ProSe rTh rG 1 uAspLeuVa 1 AsnLeuLeuProA 1 a 
3721 CATTCAAGATCATGAGCGGTGAGGTCCCCTCCACGGAGGACCTG^ 

GTAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACG6GC 



FIG. 32E 

1 1 eLeuSe rProG 1 yA 1 aLeuVa 1 Va 1 G 1 y Va 1 Va 1 Cys A 1 aA 1 a 1 1 eLeuA rgA rgH i s 
3781 CCATCCTCTCGCCCG6AGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGC 
GGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCG 

Va 1 G 1 yP roG lyG luG 1 yA 1 aVa 1 G 1 nTrpMet Asn A rgLeu I leAl aPheA 1 aSer Arg 
3841 ACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCC 
TGCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGG 

GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValThr 
3901 GGGGGAACCATGTTTCCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCA 
CCCCCTTGGTACAAAGGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGT 

AlalleLeuSerSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSer 
3961 CTGCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAA 
GACGGTATGAGTC6TCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATT 

SerGluCysThrThrProCysSerGlySerTrpLeuArgAspI leTrpAspTrpI leCys 
4021 GCTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATAT 
CGAGCCTCACATGGTGAGGTACGAG6CCAAGGACCGATTCCCTGTAGACCCTGACCTATA 

GluValLeuSerAspPheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGly 
4081 GCGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTG 
CGCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGAC 

1 1 eProPheVa 1 SerCysG 1 nAr gG 1 yTyrLysG lyValTrpA rgVa 1 AspG 1 y 1 1 eMet 
4141 GGATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCA 
CCTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGT 

HisThrArgCysHIsCysGlyAlaGluIleThrGIyHisValLysAsnGlyThrMetArg 
4201 TGCACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGA 
ACGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACT 

1 1 eVa 1 G 1 yP roA rgTh rCys ArgAsnMetTrpSerG 1 yTh rPhePro 1 1 e AsnA 1 aTy r 
4261 GGATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCT 
CCTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGA 

ThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgVal 
4321 ACACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGG 
TGTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCC 

SerAlaGluGluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMet 
4381 TGTCTGCAGAGGAATAT6TGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTA 
ACAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCAT 

ThrThrAspAsnLeuLysCysProCysGlnValProSerProGluPhePheThrGluLeu 
4441 TGACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
ACTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 

AspGlyValArgLeuHisArgPheAlaProProCysLysProLeuLeuArgGluGluVal 
4501 TGGACGGGGTGCGCCTACATAGGTTTGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGG 
ACCTGCCCCACGCGGATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCC 

SerPheArgValGlyLeuHisGluTyrProValGlySerGlnLeuProCysGluProGlu 
4561 TATCATTCAGAGTAGGACTCCACGAATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCG 
ATAGTAAGTCTCATCCTGAGGTGCTTATGGGCCATCCCAGCGTTAATGGAACGGTCGGGC 

Pr o AspVa 1 A 1 aVa 1 LeuThrSe rMetLeuThr AspProSe r H i s 1 1 eThr A 1 aG 1 uA 1 a 
4621 AACCGGACGTGGCCGTGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGCAGAGG 
TTG6CCTGCACCGGCACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCC 

AlaGlyArgArgLeuAlaArgGlySerProProSerValAlaSerSerSerAlaSerGIn 
4681 CGGCCGGGCGAAGGTTGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCC 
GCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGG 

LeuSerAlaProSerLeuLysAlaThrCysThrAlaAsnHisAspSerProAspAlaGlu 
4741 AGCTATCCGCTCCATCTCTCAAGGCAACTTGCACCGCTAACCATGACTCCCCTGATGCTG 
TCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGAC 



FIG. 32F 

LeulleGluAlaAsnLeuLeuTrpArgGlnGluMetGlyGlyAsnlleThrArgValGlu 
801 AGCTCATAGA6GCCAACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCACCAGGGTTG 
TCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCCGTTGTAGTGGTCCCAAC 



SerGluAsnLysValVal I leLeuAspSerPheAspProLeuValAlaGluGluAspGlu 
1861 AGTCAGAAAACAAAGTGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACG 
TCAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGC 

, «o , . ArgG 1 u 1 1 eSe r Va 1 Pro A 1 aG 1 u 1 1 eLeu A rgLy sSe rA rgA rgPhe A 1 aG 1 nA 1 aLeu 
1921 AGCGGGAGATCTCCGTACCCGCAGAAATCCTGCGGAAGTCTCGGAGA 

TCGCCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGG 

^ProValTrpAlaArgProAspTyrAsnProProLeuValGluThrTrpLysLysProAsp 
1981 TGCCCGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCG 
ACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTCGGGC 

, . TyrG 1 uProProVa 1 Va lHlsGl yCysProLeuProProProLysSerProProVa 1 Pro 
5011 ACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCCCTCCTGTGC 
TGATGCTTGGTGGACACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACG 

ProProArgLysLysArgTh^ 
5101 CTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGG 
GAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACC 

"LeyAlaThrArgSe r PheG 1 ySe rSerSe rTh rSerGlyl 1 eThrG 1 yAspAsnThr 
5161 CCGAGCTCGCCACCAGAAGCTT^ 

GGCTCGAGCGGTGGTCTTCGAAAtCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTAT 

TDrTdCSerSerGluProAlaProSerGlyCysProProAspSerAspAlaGluSerTyr 
5221 CGACAACATCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCT 
GCTGTTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCT6CGACTCAGGA 

, . SerSerMetProProLeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrp 

5281 ATTCCTCCATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCAT 
TAAGGAGGTACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTA 

, „ SerThrValSerSerGluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSer 
5311 GGTCAACGGTCAGTAGTGAGGCCAAC6CGGAGGATGTCGTGTGCTGCTCAATGTCTTACT 
CCAGTTGCCAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGA 

TrpThrGlyAlaLeuValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAla 
5101 CTTGGACAGGCGCACTCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATG 
GAACCTGTCCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTAC 

o . Uy§§r A § nSe i: LeuLeuA rgH i sH i sAsnLeuVa 1 TyrSerThrThrSerArgSerA 1 a 
5161 CACTAAGCAACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTG 
GTGATTCGTTGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCAC 

^ , CysG 1 nArgG 1 nLysLys Va 1 Th rPheAspA rgLeuG 1 nVa 1 LeuAspSerH i sTyrG 1 n 
5521 CTTGCCAAAGGCAGAAGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACC 
GAACGGTTTCCGTCTTCTTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCG6TAATGG 

, . ^AspVal LeuLysG 1 uVa 1 LysA 1 a A 1 a A 1 aSe rLysVa 1 Lys A 1 aAsnLeuLeuSerVa 1 
5581 AGGACGTACTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCG 
TCCTGCATGAGTTCGTCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGC 

rr., -r . § 1 y§ 1 u A 1 aCysSe rLeuThrProProH 1 sSe r A 1 a LysSe r Ly sPheG 1 yTy rG 1 yA 1 a 
5611 TAGAGGAAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGG 
ATCTCCTTC6AACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCC 

LysAspVa 1 A rgCy sH i sA 1 aA rgLys A I aVa 1 ThrH i s 1 1 eAsnSerVa 1 TrpLysAsp 
5701 CAAAAGACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAG 
GTTTTCTGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTC 

LeuLeuGluAspAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPhe 
5761 ACCTTCTGGAAGACAATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGGTTT 
TGGAAGACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAA 



FIG. 32G 



CysValGlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeu 
5821 TCTGCGTTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATC 
- AGACGCAAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAG 

GlyValArgValCysGluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAla 
5881 TGGGCGTGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGG 
ACCCGCACGCGCACACGCTTTTCTACCGAAACATGCTGCACCAATGTTTCGAGGGGAACC 

ValMetGlySerSerTyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuVal 
5941 CCGTGATGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCG 
GGCACTACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGC 

GlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCysPheAsp 
6001 TGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTG 
ACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAAC 

Se rTh rVa 1 ThrG 1 uSe rAs p 1 1 e A rgTh rG 1 uG 1 uA 1 a 1 1 eTy rG 1 nCy sCys AspLeu 
6061 ACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAATGTTGTGACC 
TGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTTACAACACTGG 

AspP roG 1 nA 1 aA rgVa 1 A 1 a 1 1 eLysSe r LueThrG 1 uA rgLeuTy rVa 1 G lyG 1 yPro 
6121 TCGACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTATGTTGGGGGCC 
AGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGG 

LeuThrAsnSer ArgG 1 yG 1 uAsnCysG 1 yTy rArgA rgCysA rgA 1 aSerG 1 y Va 1 Leu 
6181 CTCTTACCAATTCAAGGGGG6AGAACTGCGGCTATCGCAGGTGCCGCGCGAGCGGCGTAC 
GAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGCTCGCC6CATG 

ThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAla 
6241 TGACAACTAGCTGTGGTAACACCCTCACTTGCTACATCAAGGCCCGGGCAGCCTGTC6AG 
ACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGTCGGACAGCTC 

A 1 aG 1 yLe uG 1 n AspCy sTh rMe tLeuVa 1 CysG 1 yAspAspLeuVa 1 Va 1 1 1 eCy sG 1 u 
6301 CCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTG 
GGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACAC 

SerA 1 aG 1 y Va 1 G 1 nG 1 uAspA 1 aA 1 aSe r LeuA rgA 1 aPheTh rG 1 uA 1 aMetTh r A rg 
6361 AAAGCGCGGGGGTCCAGGAGGACGCG6CGAGCCTGAGAGCCTTCACGGAGGCTATGACCA 
TTTCGCGCCCCCAGGTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGT 

TyrSerAlaProProGlyAspProProGlnProGluTyrAspLeuGluLeuIleThrSer 
6421 GGTACTCCGCCCCCCCTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCATAACAT 
CCATGAGGCGGGGGGGACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTA 

CysSe rSe r Asn Va 1 Se rVa 1 A 1 aH i s As pG 1 yA 1 aG 1 y Ly s A rgVa 1 Ty rTy r Le uTh r 
6481 CATGCTCCTCCAAC6TGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACTACCTCA 
GTACGAGGAGGTTGCACAGTCAGCGGGTGCTGCCGCGACCTTTCTCCCAGATGATGGAGT 

ArgAspProTh rTh rProLeuA 1 aA rgA 1 aA 1 aTr pG 1 uThr A 1 aA rgH i sTh r ProVa 1 
6541 CCCGTGACCCTACAACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCCAG 
GGGCACTGGGATGTTGGGGGGAGCGCTCTCGACGCACCCTCTGTCGTTCTGTGTGAGGTC 

AsnSerTrpLeuGlyAsnllelleMetPheAlaProThrLeuTrpAlaArgMetlleLeu 
6601 TCAATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATAC 
AGTTAAGGACCGATCCGTTGTATTAGTACAAACGGGG6TGTGACACCCGCTCCTACTATG 

MetTh rH i SPhePheSe rVa 1 Leu 1 1 eA 1 aArgAspG 1 nLeuG 1 uG 1 nA 1 aLeuAspCys 
6661 TGATGACCCATTTCTTTAGCGTCCTTATAGCCAGGGACCAGCTTGAACAGGCCCTCGATT 
ACTACTGGGTAAAGAAATCGCAGGAATATCGGTCCCTGGTCGAACTTGTCCGGGAGCTAA 

GluIleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuProProIlelleGlnArg 
6721 GCGAGATCTACGGGGCCTGCTACTCCATAGAACCACTTGATCTACCTCCAATCATTCAAA 
CGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGA6GTTAGTAAGTTT 

Leu 

6781 GACTC 
CTGAG 



1» I 1 188 g 

J? 
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Chimp 
Lane Reference 
Number Number 



Infection 
Type 



Sample 
date (days) 
(O=lnoculation 
day) 



ALT (alanine) 
aminotransferase 
level in sera)mu/ml ) 



1 
2 
3 
4 

5 
6 
7 
8 

9 
10 
11 
12 

13 
14 
15 
16 

17 
18 
19 
20 

21 
22 
23 
24 

26 
27 
28 
29 

30 
31 
32 
33 

34 
35 
36 

37 
38 
39 
40 

41 
12 
13 
11 



1 
1 
1 
1 

2 
2 
2 
2 

3 
3 
3 
3 

1 
1 
1 

5 
5 
5 
5 

6 
6 
6 
6 

7 
7 
7 
7 

8 
8 
8 
8 

9 
9 
9 

10 
10 
10 
10 

11 
11 
11 
11 



NANB 
NANB 
NANB 
NANB 

NANB 
NANB 
NANB 
NANB 

NANB 
NANB 
NANB 
NANB 

NANB 
NANB 
NANB 
NANB 

HAV 
HAV 
HAV 
HAV 

HAV 
HAV 
HAV 
HAV 

HAV 
HAV 
HAV 
HAV 

HAV 
HAV 
HAV 
HAV 

HBV 
HBV 
HBV 

HBV 
HBV 
HBV 
HBV 

HBV 
HBV 
HBV 
HBV 



0 
76 
118 
154 

0 
21 
73 
138 

0 
13 
53 
159 

-3 
55 
83 
140 

0 
25 
40 
268 

-8 
15 
41 
129 

0 
22 
115 
139 

0 
26 
74 
205 

-290 
379 
435 

0 

111-118 

205 

240 



(pool) 



0 
28- 
169 
223 



56 (pool) 



0 
71 
19 
N/A 

0 
52 
13 
N/A 

8 

205 
14 
6 

11 
132 
N/A 
N/A 

4 

147 
18 

5 

N/A 
100 
10 
N/A 

7 
83 
5 

N/A 

15 
130 
8 
5 

N/A 
9 
6 

8 

96-156 (pool) 
9 
13 

11 

8-100 (pool) 
9 

10 
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Patient 






Lane 


Reference 




ALT Level (mju/ml) 


Number 

hi Villi 1 1 & 


^% mm/ ^ 


Diagnosis 


1 


ll 


NANB 


1354 


2 


ll 


NANB 


31 


3 


2* 


NANB 


14 


4 


2l 


NANB 


79 


5 


2l 


NANB 


26 


6 


3* 


NANB 


78 


7 


3 1 


NANB 


87 


8 




NANB 


25 


9 


Jl 


NANB 


60 


10 


41 


NANB 


13 


11 


5I 


NANB 


298 


12 


i\ 


NANB 


101 


13 


ll 


NANB 


474 


14 




NANB 


318 


15 


7I 


NANB 


20 


16 


7I 


NANB 


163 


17 


8l 


NANB 


44 


18 


8l 


NANB 


50 


19 


9 


NANB 


N/A 


20 


10 


NANB 


N/A 


21 


11 


NANB 


N/A 


22 


12 


Normal 


N/A 


23 


13 


Normal 


N/A 


24 


14 


Normal 


N/A 


26 


30174 


Normal 


N/A 


27 


30105 


Normal 


N/A 


28 


30072 


Normal 


N/A 


29 


30026 


XT — . _i ... — 1 

Normal 


M / A 
Si/ A 


30 


3U14o 


in 0 xr max 


N /A 


31 


1 AO C A 

3UZ5U 


normal 




32 


3UC , 1 


Normal 


N /A 

M / A 


33 


1 c 
13 




N/A 
W / A 


34 


10 


ACUtfinAV 


N /A 

W / A 


35 


1 / 


ACuLcnAV 


N /A 

W / A 


3o 


1 S 

lo 


ACUtctUiv 


N/A 

1H / A 


^7 
j / 


*t v woo 


AcuteHAV 


N/A 


38 


47288 


AcuteHAV 


N/A 


39 


47050 


AcuteHAV 


N/A 


40 


46997 


AcuteHAV 


N/A 


41 


19 


Convalescent HBV 


N/A 


42 


20 


( anti-HBSag+ve ; 


N/A 


43 


21 


anti-HBCag+ve) 


N/A 


44 


22 


(anti-HBSag+ve ; 


N/A 


45 


23 


anti-HBCag+ve) 


N/A 


46 


24 


(anti-HBSag+ve ; 


N/A 


47 


25 


ant i -HBCag+ve ) 


N/A 


48 


26 


(anti-HBSag+ve ; 


H/A 


49 


27 


anti-HBSag+ve) 


N/A 



is equential 



serum samples were assayed from these patients 




FIG. 35 
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FIG. 36A 

SOD COOM [--adaptor ] INANBHpolypeptide> 

AlaCysGlyVal I leGlyl leAlaGlnAsnLeuGlyl leArgAspAlaHisPheLeuSer 
1 GCTTGTGGTGTAATTGGGATCGCCCAGAATTTGGGAATTCG6GATGCCCACTTTCTATCC 
CGAACACCACATTAACCCTAGCGGGTCTTAAACCCTTAAGCCCTACGGGTGAAAGATAGG 

»>»»»>>»>»>»>> 

G 1 nTh rLysG 1 nSe rG 1 yG 1 uAsnLeuProTy rLeuVa 1 A 1 aTy rG 1 nA 1 aTh rVa 1 Cys 
61 CAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGC 
GTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACG 

AlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeuI leArgLeu 
121 GCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTC 
CGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAG 

LysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGlu 
181 AAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGAA 
TTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTT 

HeThrLeuThrHisProValThrLysTyrlleMetThrCysMetSerAlaAspLeuGlu 
241 ATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGAG 
TAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTC 

ValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeuAlaAlaTyrCys 
301 GTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGC 
CAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGGGCATAACG 

LeuSerThrGlyCysValVallleValGlyArgValValLeuSerGlyLysProAlalle 
361 CTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATC 
GACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAG 

1 1 eProAspA rgG 1 uVa 1 LeuTy r A rgG 1 uPheAspG 1 uMetG 1 uG 1 uCysSe rG 1 nH i s 
121 ATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCAC 
TATGGACTGTCCCTTCAGGAGATGGCTCTCAA6CTACTCTACCTTCTCACGAGAGTCGTG 

LeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGly 
481 TTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGC 
AATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCG 

LeuLeuG 1 nTh r A 1 aSe r ArgG InAlaGluVallleAl aProA 1 aVa 1G 1 nTh r AsnTr p 
5H1 CTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGG 
GAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACC 

G 1 nLysLeuGl uThrPheTrpA 1 aLysH i sMetTrpAsnPhe 1 1 eSerG 1 y 1 1 eG 1 nTy r 
601 CAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATAC 
GTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATG 

LeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThr 
661 TTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACA 
AACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGT 

AlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGly 
721 GCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGG 
CGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCC 

TrpValAlaAlaGinLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeu 
781 TGGGTG6CTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTA 
ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAAT 




FIG. 36B 

AlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGly 
841 GCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGG 
CGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCC 

Ty rG 1 yA i aG 1 y Va 1 A 1 aG 1 y A 1 aLeuVa 1 A 1 aPheLy s 1 1 eMe tSe rG 1 yG 1 u Va 1 P ro 
901 TATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCC 
ATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGG 

SerTh rG 1 uAspLeuVa 1 AsnLeuLeuProA 1 a 1 1 eLeuSerProG 1 yA 1 aLeuVa 1 Va 1 
961 TCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTC 
AGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAG 

GlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAlaValGln 
1021 6GCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCA6 
CCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTC 

<<<<<<<<<<<<<<<<< <<<NANBH] I — extra 
TrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProValHisHis 
1081 TGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCAGTCCATCAT 
ACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGTCAGGTAGTA 

] 

LysArgOP 

1111 AAGCGTTGACGCTCCCTACGGGTGGACTGTGGAGAGACAGGGCACTGCTAAGGCCCAAAT 
TTCGCAACTGCGAGGGATGCCCACCTGACACCTCTCTGTCCCGTGACGATTCCGGGTTTA 

1201 CTCAGCCATGCATCGAGGGGTACAATCCGTATGGCCAACAACTAGCGCGTACGTAAAGTC 
GAGTCGGTACGTAGCTCCCCATGTTAGGCATACCGGTTGTTGATCGCGCATGCATTTCAG 

1261 TCCTTTCTCGATGGTCCATACCTTAGATGCGTTAGCATTAATCCGAATTC 
AGGAAAGAGCTACCAGGTATGGAATCTACGCAATCGTAATTAGGCTTAAG 



C4 




O 



FIG. 37A 

1 2 3 FIG. 37B 




FIG. 42A 

10 20 30 40 50 

HCV EYVVLLFLLLADARVCSCLWMMLL I SQAEAALENLVI LNAASLA6THGLVSFLVFFCFA 

MNWVD1 AVSFVTLIT6NMSFRDL6RVMVMVGATMTDDIGMGVTYLALLAAFKVRPTFAAGLLLRKL 
130 140 150 160 170 180 

60 70 80 90 100 110 

HCV WYLKGKWVPGAVYTFYGMWPLLLLLLALPQRAYALDTEVAASCGGVVLVGLMALTLSPYY 

MNWVD1 TSKELMMTTI G I VLLSQSTI PETI LELTDALALGMMVLKMVRKMEKYQLAVTI MA I LCVP 
190 200 210 220 230 240 

120 130 140 150 160 170 

HCV KRYISWCLWWLQYFLTRVEAQLHVWIPPLNVRGGRDAVILLMCAVHPTLVFDITKLLLAV 

MNWVD1 NAVILQNAWKVSCTILAVVSVSPLFLTSSQQ^ 

250 260 270 280 290 

180 190 200 210 220 230 

HCV FGPLWILQASLLKVPYF-VRVQGLLRF-CALARKMIGGHYVQMVI IKLGALTGTYVYNHL 

■ » ■ • • • ■ 

MNWVD1 KKRSWPLNEAiMAVGMVSILASSLLKNDIPMTC 

300 310 320 330 340 350 

240 250 260 270 280 290 

HCV TPLRDWAHNGLRDLAVAVEPWFSQMETKLITWGADTAACGDI INGLPVSARRGREILLG 

MNWVD1 ADVK-WEDQAE I SGSSP i LSiti SE-DG S MS i KNEEEEQTLT I L I RTGLLVISG — LFP 
360 370 380 390 400 410 

300 310 320 330 340 350 

HCV PADGMVSKGWRLLAPITAYAQQTRGLLGCI ITSLTGRDKNQVEGEVQIVSTAAQTFLATC 

MNWVD1 VS I PI TA AAWYLWEVKKQRAGVLWDVPSPPPVGKAELEDGA YR I KQKG i LGYS^l i GAGVY 
420 430 440 450 460 470 

360 370 380 390 400 410 

HCV I NGVCWTVYHGAGTRT I ASPKGPV I QMYTN VDQDLV GWPAPQGSRSLTPCTCGSSD 

MNWVD1 KEGTFHtMWHVtRGAVLMHKGKRiEPSWADVKKDLVSCGGGWKLEGEW 

480 490 500 510 520 530 

420 430 440 450 460 470 

HCV LYLVTRHADVIPVRRRGDSRGSLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGV 

• • •••• ••• 

MNWVD1 PGKNPRAVQTKPGLFKtN--AGTiGAVSLD^ 

540 550 560 570 580 590 



JAM 2 



7 Wfi 



FIG. 42B 

480 490 500 510 520 530 

HCV AKAVDFIPVENLETTMRSPVFTDNSSPPVVPQSFQVAHLHAPTGSGKS — TKVPAAYAAQ 

MNWVD1 AYVSA \ AQTEK — S I EDNPE i EDD I FRK — RKLTIMDLHPG A6l<tl<:RYLPA I VRGA I KR 
600 610 620 630 640 

540 550 560 570 580 

HCV GYKVLVLNPS--VAATLGFGAYMSKAHGIDPNIRTGVRTITTGSPITYSTYGKFLADGGC 

MNWVD1 GLRTLi LAPtRVVAAEMEEALRGLPI RYOtpAi RAEHTGREI VDLMCHAtFtMRtL-SPV 
650 660 670 680 690 700 

590 600 610 620 630 640 

HCV SGGAYDI 1 1 CDECHSTDATS I LG I GTVLDQAETAGARLVVLATATPPGS VTVPHPN I EEV 

MNWVD1 RVPNYNL i iMDEAHFtDPAS I AARGY I STRVE-MGEAAG i FMtAtPPGSRD-PFPQSNAP 
710 720 730 740 750 760 

650 660 670 680 690 700 

HCV ALSTTGEIPFYGKAIPLEVIKGGRHLIFCHSKKKCDELAAKLVALGINAVAYYRGLDVSV 

MNWVD1 IMDEEREIPERSWSSGHEWVTDFKGKTVWFVPSIKAGNDTAACLRKNGKKVTQLSRKTFD 
770 780 790 « n 0 810 820 

710 720 730 740 750 760 

HCV IPTSGDVVVVATDALMTGYTGDFDSVIDCNTCVTQTVDFSLDPTFTIETITLPQDAVSRT 

MNWVD1 SEYVKTRTNDWNFVVTTDISEM6ANFKAERVIDPRRCMKPVILTDGEERVILAGPMPVTH 
830 840 850 860 870 880 

770 780 790 800 810 820 

HCV QRRGRTGRGKPGIYRFVAPGERPSGMFDSSVLCECYDAGCAWYELTPAETTVRLRAYMNT 

MNWVD1 SS 
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[=□ MoAB = 1056 
■■ Poly = 1077 
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FIG. 45 

Name Common Sequence 



Variable Sequence 



5'-3-l 


AAGCTTGATCGAATTC 


CGATCTTGC 


-2 




CGATCCTGC 


-3 




djAlt-Al bL 


-4 




CCjATCLjI GC 


-5 




r*r* tv » /rnrrv^o 
CGAAGTTGC 


-6 




CGAAGCTGC 


-7 




AGATCTTGC 


-8 




AGATCCTGC 


-9 




AGATCATGC 


-10 




AGATCGTGC 


-11 




AGAAGTTGC 


-12 




AGAAGCTGC 


-13 




CGATCTTGT 


-14 




CGATCCTGT 


-15 




TV mp TV rprrp 


-16 




tv T»fTTp/Tp 
ClaA 1 UVj 1 Vj 1 


-17 




71 TV Orprnr^rp 
LbAAb 1 1 lj 1 


-18 






- 19 




iv tv mrqiqirai 
Avx A ILI 1 Vj I 


-20 




AGATCCTGT 


-21 




AGATCATGT 


-22 




AGATCGTGT 


-23 




AGAAGTTGT 


-24 




AGAAGCTGT 


-25 




CGCTCTTGC 


-26 




CGCTCCTGC 


-27 




CGCTCATGC 


-28 




CGCTCGTGC 


-29 




CGCAGTTGC 


-30 




CGCAGCTGC 


-31 




CGCTCTTGT 


-32 




CGCTCCTGT 


-33 




CGCTCATGT 


-34 




CGCTCGTGT 


-35 




CGCAGTTGT 


-36 




CGCAGCTGT 
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FIG. 47 A 

1 



- G 1* £>?sPr oG 1 uArgLeuA 1 aSe rCysArgProLeuTh rAspPheAspGlnGlyTrpGly 
CAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGG 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ri ^ProIleSerTyrAlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyrPro 
61 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGG 

GGGGTTTTGGAACGCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAA6T 

, 0 -, r-£t2&t ECSXl lVa l Va 1 G 1 yTh rTh rA spA rgSe r G 1 yA 1 a P roTh rTy rSe r T r pG 1 y 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGG6CGC6CCCACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCC 

o , , 1 uAsnAspThr AspVa 1 PheVa 1 LeuAsnAsnThrA rgProProLeuG 1 yAsnTrpPhe 
241 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

30! TC§IMI!EM^ 

AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

^HeGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAJ^ThrTyrSerArgCysGlySerGlyPro^ 
121 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 

, o, . TyrProTyrArgLeuTrpHisTyrProCysThrl leAsnTyrThrl lePheLysI leArg 
481 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCA 
TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGT 

, „ MetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsnTrpThrArgGlyGlu 
541 GGATGTACGTGGGAGGGGTCGAACACA6GCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 

™, . ArgCysAspLeuG 1 uAspArgAspArgSerG 1 uLeuSerProLeuLeuLeuThrThrThr 
601 AACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTA 
TTGCAACGCTAGACCTTCT6TCCCTGTCCA6GCTC6AGTCGGGCAATGACGACT6GTGAT 

_ „ GInTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeuSerThrGlyLeuI le 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGT 

HisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 

™ , ^ SerTrpAla 1 1 eLy sTr pG 1 uTy rVa 1 Va lLeuLeuPheLeuLeuLeuA 1 a AspA 1 aA rg 
781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCG 

0 , , V§ J CysSe rCysLeuTrpMe tMetLeuLeuI 1 eSerGl nA 1 aG 1 uA 1 aA 1 aLeuG 1 uAsn 
841 GCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGGAG6CGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCT 

^LeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeuValSerPheLeuVal 
901 ACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 




FIG. 38 
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N.T. = EXPRESSION NOT TESTED 
t THS POLYPEPTIDE WAS NEGATIVE IN THIS 
COLONY SCREEN BUT POSITIVE BY WESTERN 
BLOT ANALYSIS 
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FIG. 66A 



R T 

MSTNPKPQKKNKRNTNRRPQDVKFP666QIV6GVYLLPRRGPRLGVRATR 
KTSERSQPRGRRQP1PKARRPEGRTWAQPGYPWPLYGNEGCGWAGWLLSP-100 
RGSRPSWGPTDPRRRSRNLGKV I DTLTCGFADLMGY I PLVGAPLGGAARA 

T 

LAHGVRVLEDGVNYATGNLPGCSFS I FLLALLSCLTVPASAYQVRNSTGL-200 
YH VTNDCPNSS I VYEAAD A I LHTPGCVPCVREGNASRCWVAMTPTVATRD 
GKLPATQLRRH IDLLVGSATLCSALYVGDLCGSVFLVGQLFTFSPRRHWT-300 

V 

TQGCNCS I YPGHI TGHRMAWDIWINWSPTTAL VMAQLLR I PQA ILDMIAG 

AHWGVLAGIAYFSIWGNWAKVLVVLLLFAGVDAETHVTGGSAGHTVSGFV-400 

SLLAPGAKQNVQL I NTNGSWHLNSTALNCNDSLNTGWLAGLFYHHKFNSS 

GCPERLASCRPLTDFDQGWGPISYANGSGPDQRPYCWHYPPKPCGIVPAK-500 

SVCGPVYCFTPSPVVVGTTDRSGAPTYSWGENDTDVFVLNNTRPPLGNWF 

GCTWMNSTGFTKVCGAPPCVIGGAGNNTLHCPTDCFRKHPDATYSRCGSG-600 

PWLTPRCLVDYPYRLWHYPCT INYTIFKI RMYVGGVEHRLEA ACNWTRGE 
RCDLEDRDRSELSPLLLTTTQWQVLPCSFTTLPALSTGLIHLHQNIVpVQ-700 
YLYGVGSS I ASWA I KWEYVVLLFLLLADARVCSCLWMMLL I SQAEAALEN ^ 
LVILNAASLAGTHGLVSFLVFFCFAWYLKGKWVPGAVYTFYGMWPLLLLL-800 

(N) 

LALPQRAYALDTEVAASCGGVVLVGLMALTLSPYYKRY I SWCLWWLQYFL nnn 
TRVEAQLHVW I PPLNVRGGRDAV I LLMCA VHPTLVFD I TKLLL AVFGPLN-900 
I LQASLLKVPYFVRVQGLLRFCALARKMI GGHYVQIWI I KLGALTGTYVY 
NHLTPLRDWAHNGLRDLAVAVEFVVFSQMETKLITWGADTAACGDI INGL-IOOO 

PVSARRGREILLGPADGMVSKGWRLLAPITAYAQQTRGLLGCI ITSLTGR 

DKNQVEGEVQ I VSTAAQTTF ATC I NGVCWTVYHG AGTRT I ASPKGPV I QM- 1 100 
YTNVDQDLVGWPAPQGSRSLTPCTCGSSDLYLVTRHADV I PVRRRGDSRG nnn 
SLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGVAKAVDFIPVEN-1200 
LETTMRSPVFTDNSSPPVVPQSFQVAHLHAPTGSGKSTKVPAAYAAQGYK 

VLVLNPSVAATLGFGAYMSKAHG IDPNI RTGVRT I TTGSP I TYSTYGKFL- 1300 

ADGGCSGGAYDI 1 1 CDECHSTDATS I LG I GTVLDQAETAGARLVVLATAT 
PPGSVTVPHPN I EEVALSTTGE 1 PFYGKA I PLEV I KGGRHL I FCHSKKKC-1400 
DELAAKLVALGINAVAYYRGLDVSVIPTSGDVVVVATDALMTGYTGDFDS 

Y (S) 
V I DCNTCVTQTVDFSLDPTFT I ET 1 TLPQDAVSRTQRRGRTGRGKPG I YR- 1500 
FVAPGERPSGMFDSSVLCECYD AGC AWYELTPAETTVRLRA YMNTPGLPV ^ nn 
CQDHLEFWEGVFTGLTHIDAHFLSQTKQSGENLPYLVAYQATVCARAQAP-1600 
PPSWDQMWKCLIRLKPTLHGPTPLLYRLGAVQNEITLTHPVTKY MTCMS 
ADLEVVTSTWVLVGGVLAALAAYGLSTGCVVIVGRWLSGKPAI IPDREV-1700 
LYREFD.EMEECSQHLPYIEQGMMLAEQFKQKALGLLQTASRQAEVIAPAV iortft 
QTNWQKLETFWAKHMWNF I SG I QYL AGLSTLPGNPA I ASLMAFT A A VTSP- 1800 
LTTSQTLLFN I LGGWVAAQLAAPGAATAFVGAGLAGAA I GSVGLGKVL I D 



FIG. 66B 



(G) 

ILAGYGAGVAGALVAFKIMSGEVPSTEDLVNLLPAILSPGALVVGVVCAA-1900 

(HO 

ILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPESDAAARVTAILSS 
LTVTQLLRRLHQWISSECTTPCSGSWLRDIWDWICEVLSDFKTWLKAKLM-2000 

(V) 

PQLPGIPFVSCQRGYKGVWRGDGIMHTRCHCGAEITGHVKNGTMRIVGPR 

TCRNMWSGTFPINAYTTGPCTPLPAPNYTFALWRVSAEEYVEIRQVGDFH-2100 

YVTGMTTDNLKCPCQVPSPEFFTELDGVRLHRFAPPCKPLLREEVSFRVG 

LHEYPVGSQLPCEPEPDVAVLTSMLTDPSHITAEAAGRRLARGSPPSVAS-2200 

SSASQLSAPSLKATCTANHDSPDAELIEANLLWRQEMGGNITRVESENKV 

VI LDSFDPLVAEEDEREI SVPAEI LRKSRRFAQALPVWARPDYNPPLVET-2300 

S 

WKKPDYEPPVVHGCPLPPPKSPPVPPPRKKRTVVLTESTLSTALAELATR 

(FA) 

SFGSSSTSGIT6DNTTTSSEPAPSGCPPDSDAESYSSMPPLEGEPGDPDL-2400 
SDGSWSTVSSEANAED WCCSMSYSWTGALVTPCAAEEQKLP I NALSNSL 
LRHHNLVYSTTSRSACQRQKKVTFDRLQVLDSHYQDVLKEVKAAASKVKA-2500 

(F) 

NLLSVEEACSLTPPHSAKSKFGYGAKDVRCHARKAVTHINSVWKDLLEDN 

VTPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVT-2600 

KLPLAVMGSSYGFQYSPGQRVEFLVQAWKSKKTPMGFSYDTRCFDSTVTE 

(G) 

SD I RTEEA I YQCCDLDPQA RVA I KSLTERLYVGGPLTNSRGENCGYRRCR-2700 
ASGVLTTSCGNTLTCY I KARAACRAAGLQDCTMLVCGDDL VV I CESAGVQ 
EDAASLRAFTEAMTRYSAPPGDPPQPEYDLELITSeSSNVSVAHDGAGKR-2800 
VYYLTRDPTTPLARAAWETARHTFVNSWLGNIIMFAPTLWARMILMTHFF 
SVLIARDQLEQALDCEIYGACYSIEPLDLPPI IQRLHGLSAFSLHSYSPG-2900 

G 

EINRVAACLRKLGVPPLRAWRHRARSVRARLLARGGRAAICGKYLFNWAV 
RTKLK — (Stop codon not yet reached) 



( ) = Heterogeneity due to possible 5' or 3' 
terminal cloning artefacts. 
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FIG. 47B 

PhePheCysPheAlaTrpTyrLeuLysGlyLysTrpValProGlyAlaValTyrihrPhe 
961 TGnCTTCT6CTTT6CATG6TAhT6AAGGGT 

ACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA 

Ty rG lyMetTrpProLeuLeuLeuLeuLeuLeuA 1 aLeuP roG 1 nA rgA 1 aTy rA 1 aLeu 
1021 TCTACGGGATGTGbCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGC 
AGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG 

AspTh rG 1 uVa 1 A 1 aA 1 aSe rCysG 1 yG 1 yVa 1 Va 1 LeuVa 1 G 1 yLeuMe t A 1 aLeuThr 
1081 TGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGnCTCGTCGGGTTGATGGCGCTGA 
ACCTGTGCCTCCACC6GCGCAGCACACCGCCACAACAAGAGCAGCCCAAGTACCGCGACT 

LeuSerProTyrTyrLysArgTyrl leSerTrpCysLeuTrpTrpLeuGlnTyrPheLeu 
ii£ii ctctgtcaccataHacaagcgctatatcagctggtgcttgtggtggcttcagtattt^ 

GAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAG 

ThrA rgVa 1 G 1 uA 1 aG 1 nLeuH i s Va lTrp 1 1 eProProLeuAsnVa 1 ArgGlyGlyArg 
1201 TGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGja 

ACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCG 

AspA laVa 1 1 1 eLeuLeuMe tCys A 1 aVa 1 H i sProThrLeuVa 1 PheAsp 1 1 eThrLys 
1261 GCGACGCCGTCATCTTACTCATGTGTGCTGTAGACCCGACTCTGGTAT^ 

CGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGT 

LeuLeuLeuAlaValPheGlyProLeuTrpI leLeuGlnAlaSerLeuLeuLysValPro 
1321 AATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAG^ 

TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATG 

Ty rPheVa 1 A rgVa lGlnGl yLeuLeuA rgPheCys A laLeuA 1 aA rgLysMet I leG ly 
1381 CCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGC^ 

GGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGC 

GlyHisTyrValGlnMetValllelleLysLeuGlyAlaLeuThrGlyThrTyrValTyr 
mm GAGGCCATTACGTGCAAATGGTCATCATTAAGTTAGGGGCGCT^ 

CTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAA 

AsnH i sLeuThrProLeuA rgAspTrpA 1 aH i sAsnG lyLeuArgAspLeuA 1 aValA 1 a 
1501 ATAACCATCTCACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCT 

TATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACC 

Va 1 G 1 uPro va 1 Va 1 PheSe rG 1 nMetG 1 uTh rLy sLeu 1 1 eTh rTr pG 1 yA 1 aaspThr 
1561 CTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGG^ 

GACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTAT 

AlaAlaCysGlyAspI lei leAsnGlyLeuProVal^erAlaArgArgGlyA^ le 
1621 CCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCCGC^ 

GGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCT 

LeuLeuG 1 y ProA 1 aAspG 1 yfle tVa 1 Se rLy sGJ^Tr pA rgLeuLeuAl aP ro 1 1 eTh r 
1681 TACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCG^ 

ATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGT 

AlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysIl^ 
1741 CGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCACCAG^ 

GCCGCATGCGGGTCGTCTGTTCCCCGGA6GATCCCACGTATTAGTGGTCGGATTGACCGG 

AspLys AsnG 1 nVa 1 G 1 u6 1 yG 1 uVa 1 G 1 n 1 1 e Va ISerTh rA 1 aA 1 aG 1 nThr PheLeu 
1801 GGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTG^ 

CCCTGTTTTTGGTTCACCTCCCACTCCA6GTCTAACACAGTTGACGACGGGTTTGGAAGG 

A 1 aThr Cvs 1 1 e AsnG 1 y Va 1 CysTrpTh rVa 1 Ty r H 1 sG 1 y AlaGlyThrArgThrl le 
1861 TGGCAACGTbCATCAATGGGGTGTbCTGGACTGTCTACCACGGGGCCGGAACGAGGACCA 
ACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCCG6CCTTGCTCCTGGT 
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FIG. 47C 

TrpProA 1 aProGl nG 1 ySe rArgSerLeuTh rProCysThrCysG lySerSer AspLeu 

20*1 Ti«m^^ 

AAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCC 

SerLeuLeuSerProArgProUeSerTyr^ 
2101 GCAGCCTGCTGTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCC^ 

CGTCGGACGACAGCGGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACA 

CysProAlaGlyHisAlaValGlyllePheArgAlaAlaValCysThrArgGlyValAla 

2221 att&ffifMilBM 

GATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACA 

ThrAsDAsnSerSerProProValValProGlnSerPheGlnValAlaHisLeuHisAla 

2281 TCACGGA?AACTCCTCTCCACCACT 

AGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTAC 

ProThrGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLys 

2341 CTCCCACAGGCAGCGGCAAAAGCACCAAGGTC^ 

GAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATAT 

Vail pnValLeuAsnProSerValAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLys 

2401 AGGTGCT AGTACTCAACCCCTCTGT^ 

TCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGT 

AlaHisGlylleAspProAsnlleArgTtirGlyValArgThrlleThrThrGlySerPro 
IleThrTvrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyr 
Aspl 1 e I le I leCysAspG 1 uCysH i -sSe rTh-rAspA 1 aTh rSe rl leLeuGlylleGly 

26« gcactgtccttgaBcaagcagag^ 
cgtgacaggaactggttcgtck^ 

ProProGlySerValThrValProHisProAsnlleGluGluValAlaLeuSerThrThr 
2701 CCCCTCCGGGCTCCGTCAC^TGCCCCATCCCAACATCG 
GGGGAGGCCCGAGGCAGTGAW 

fil vfiiuT l pProPheTv rG 1 yLysA 1 a 1 1 eP roLeuG 1 uVa 1 1 1 eLysG 1 yG 1 yA rgH i s 

2821 ATCTCMCncflifcATOAAjtG^ 

TAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTA 

GivlleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVall leProThrSerGly 

AsDValValValValAlaThrAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSer 

2Mi ^X^mms^mmmmm 



FIG. 47D 

Val I leAspCysAsnThrCysValThrGlnThrValAsgPheSerLeuAspProThrPhe 
3001 CG6TGATAGACTGCAATACGT6TGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCT 
GCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGA 

3061 tcaccattgagacaUcacgctcccc^ 

agtggtaactctgttagtgcgagggggtcctacgacagagggcgtgagttgcagccccgt 

ThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGly 
3121 GGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCG 
CCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGC 

MetPheAspSerSerVa 1 LeuCysG 1 uCysTyrAspA 1 aG lyCysA 1 aTrpTy rG 1 uLeu 
3181 GCATGTTCGACTC6TCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGC 
CGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCG 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
3241 TCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCG 
AGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGC 

CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAla 
3301 TGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATG 
ACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTAC 

HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGln 
3361 CCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACC 
GGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGG 

A 1 aTh rVa 1 Cys A 1 aA rgA 1 aG 1 nA 1 aProProProSerTrpAspG 1 nMetTrpLysCys 
3421 AAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGT 
TTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCA 

LeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAla 
3481 GTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCG 
CAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGC 

Va 1 G 1 nAsnG 1 u 1 1 eTh r LeuTh rH i sP roVa ITh rLy sT-y r 1 1 eMe tThrCysMe tSe r 
3541 CTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGT 
GACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACA 

AlaAspLeuGluValValThrSerThrTrpValLfeuValGlyGlyValLeuAlaAlaLeu 
3601 CGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTT 
GCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAA 

A 1 a A 1 aTy r Cy s Le uSe rTh r G 1 yCy s Va 1 Va 1 1 1 eVa 1 G 1 y A rgVa 1 Va 1 LeuSe rG 1 y 
3661 TGGCC6CGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCG 
ACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGC 

LysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMett 
3721 GGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGA^ 

CCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTC 

CysSerG 1 nH I sLeuProTy r 1 1 eG 1 uG InG 1 yMetMetLeuA 1 aG 1 uG InPheLysG 1 n 
3781 AGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAA^ 
TCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCG 

LysA 1 aLeuG lyLeuLeuG 1 nThrA 1 aSerA rgG 1 nA 1 aG 1 uVa 1 1 1 eA 1 aProA 1 aVa 1 
3841 AGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTG 
TCTTCCG6GAGCCGGAG6ACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGAC 

GlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSer 
3901 TCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCA 
AGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGT 

GlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeu 
3961 GTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCAT 
CACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTA 
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FIG- 47E 

MetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsn 
4021 T6AT66CTTTTTACAGCT6CT6TCACCAGCCCACTAACCACTAGCCAAACC 
ACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGG 

1 1 eLeuG 1 yG 1 yTrpVa 1 A 1 aA 1 aG 1 nLeu A 1 aA 1 aP roG lyA 1 aA 1 aTh fA 1 aPheVa 1 
4081 ACATATTG^GGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTA 
TGTATAACCCCCCCACCCACCGACGGGTCGAG 

G 1 yA 1 aG 1 yLeu A 1 aG 1 yAl aA 1 a 1 1 eG 1 ySe rVa 1 G 1 yLeuG 1 yLysVal Leu 1 1 eAsp 
4141 TGGGCGCTGGCTTAGCTGGCGCCGCCATC^ 

ACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATC 

Tlpi puAlaGlvTvrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSer 

4201 ACATCCTTGCAGGGTATGGC^ 

TGTAGGAACGTCCCATACCGCGC^ 

G 1 yG 1 uVa 1 ProSe rTh rG 1 uAspLeuVa 1 AsnLeuLeuProA 1 a 1 1 eLeuSerP roG 1 y 

4261 gcggTgaggtcccctccaUgaggacctggtc^^^^ 

cgccactccaggggaggtgcctcctggaccagttagatgacgggcggtaggagagcgggc 

AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGlu 
GAGCCCTCGTAGTCGGCGTGGTCTG^ 

CTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGC 

GlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHi^ 

4381 AGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCn 

TCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAA 

P roTh rH i sTy rVa 1 ProG 1 uSe r A spA 1 aA 1 aA 1 a A rgVa 1 Th rA 1 a 1 1 eLeuSe rSe r 
4441 CCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCA 

GGGGGTGCGTGATGCACGGCCT 

LeuTh rVa 1 Th rG 1 nLeuLeuA r g A rgLeuH 1 sG 1 nTrp II eSe rSe rG 1 uCysThrThr 
4501 GCCTCACTGTAACCCAGCTCCTGAGGCGACT^ 

CGGAGTGACATTGGGTC6AGGACTCC6CTGACGTGGTCACCTATTCGAGCCTCACATGGT 

Profv^SerGlvSerTrDLeuArgAspI leTrpAspTrpI leCysGluValLeuSerAsp 
4561 CTCCATGCTCCGGTTCaGGCTAAGG^ 
GAGGTACGAGGCCAAGG^ 

PheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGlyneProPheV^ 

4621 actttaagacctggctaaaag 

tgaaattctggaccUttttcgattcga 

Pv«;GlnAraGlvTvrLysGlyValTrpArgValAspGlyIleMetHisThrArgCysHis 

4681 CCTGCCAGCG^GGGTATAAGGGG^ 

GGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGG 

CvsG l yA 1 aG 1 u 1 1 eTh rG 1 yH i s Va 1 Ly s AsnG 1 yTh rMe t A r g 1 1 eVa 1 G 1 y P r o A r g 
ThrCvsAraAsnMetTrpSerGlyThrPheProI leAsnAlaTyrThrThrGlyProCys 

CATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTA 

TACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAG 
LysCysProCysG 1 nVa 1 ProSe rProGl uPhePheTh^ 

4981 mimiMMmmsMMJiBmsm 



FIGi 47 F 

" HisA raPheA 1 aProProCy sLysProLeuLeuArgG 1 uG 1 uVa 1 Se rPheA rgVa 1 G 1 y 
5041 TACATAGGTTTGCGCCCCCCTGCAAG^ 

ATGTATCCAAACGCGGGGGGACGTTCGGGA 

LeuHisGluTyrProValGlySerGlnLeuProCysGluProGluProAspValAlaVal 

5101 mm^smmmsmmsmmm 
5 i6i TsKsssassa^ 

ACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCA 

AlaArgGlySerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaP^ 
5221 TGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCJAGCCA^ 

ACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTA 

LeuLysAlaThrCysThrAlaAsnHisAspSerProAspAlaGluLeuI leGluAlaAsn 
5281 CTCTCAAGGCAACTTGCACCGCTAACCATGACTCCCCT 

GAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGT 

LeuLeuTrpA rgG 1 nG 1 uMe tG 1 yG 1 yAsn 1 1 eTh r A rgVa 1G 1 uSe rG 1 uA snLysVa 1 

ACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGC 

ProAlaGluIleLeuArgLysSerArgArgPheAlaGlnAlaLeuP™^ 
5461 TACCCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGC^ 

ATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCG 

ProAspTyrAsnProProLeuYalGluThrTrpLysLysProAspTyrGluProProVal 
5521 GGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCA^ 

CCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGAC 

Va 1H i sG lyCysProLeuProProProLysSerProProVa 1 ProPrpProA rgLysLys 
5581 TGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCCCTCCTG 

ACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCT 

ArgThrValValLeuThrGluSerThrLeuSerThrAlaLeuM^ 
5641 AGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGC^ 

TCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGT 

SerPheGlySerSerSerThrSerGlylleT^ 
5701 GAAGCTTTGGCAGCTCCTCAACTTCC6GCATTACGGGCGACAATACGACAACATCCTCTG 
CTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGAC 

ProA 1 aProSerG 1 yCysProProAspSerAspA laG 1 uSerTy rSe rSerMetProPro 
5761 AGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCAT 

TCGGGCGGGGAAGACCGACGGGGGGGCTGAG6CT6CGACTCAGGATAAGGAGGTACGGGG 

LeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrpSerTh^alSerSer 
5821 CCCTGGAGGGGGAGCCTGGbGATCCGGATCTTAGCGACGGGTCATGGTCAACG^ 

GGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCAT 

G I uA 1 aAsnA laG 1 uAspVa 1 Va ICysCysSerMetSerTy rSe rTr pThrG 1 yA laLeu 
5881 GTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCT^ 

CACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTG 

ValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnA^ 
5941 TCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGT 
AGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCA 

LeuArgHisHisAsnLeuValTyrSerThrThrSerArgSerAlaCysGlnArgGlnLys 
6001 TGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGA 
ACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCT 



FIG. 47G 

Lys Va 1 Th r PheAspArgLeuG 1 nVa 1 LeuAspSerH i sTy rG 1 nAspVa 1 LeuLysG 1 u 
6061 AGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGG 
TCTTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCC 

ValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGluAlaCysSer 
6121 AGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCA 
TCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCCTTCGAACGT 

LeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAspValArgCys 
6181 GCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCCGTT. 
CGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTCTGCAGGCAA 

HisAlaArgLysAlaValThrHisI leAsnSerValTrpLysAspLeuLeuGluAspAsn 
6241 "GCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACA 
CGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGT 

ValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysValGlnProGlu 
6301 ATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTG 
TACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGCAAGTCGGAC 

LysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyValArgValCys 
6361 AGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGT 
TCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCACA 

GluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAlaValMetGlySerSer 
6121 GCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCT 
CGCTTTTCTACCGAAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGA . 

TyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAlaTrpLysSer 
6181 CCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGT 
GGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCA 

LysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThrValThrGlu 
6541 CCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTG 
GGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGTGAC 

SerAspI leArgThrGluGluAlal leTyrGlnCysCysAspLeuAspProGlnAlaArg 
6601 AGAGCGACATCCGTACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCC 
TCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTTACAACACTGGAGCTGGGGGTTCGGG 

ValAlalleLysSerLeuThrGluArgLeuTyrValGlyGlyProLeuThrAsnSerArg 
6661 GCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAATTCAA 
CGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTT 

GlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValLeuThrThrSerCysGly 
6721 GGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTG 
CCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACAC 

AsnTh rLeuThrCysTy r 1 1 eLysA 1 aA rgAl aA 1 aCysArgA 1 aA laG lyLeuG 1 nAsp 
6781 GTAACACCCTCACTTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGG 
CATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCC 

CysThrMetLeuVa 1 CysG 1 y AspAspLeuVa 1 Va 1 1 1 eCy sG 1 use r A 1 aG 1 y Va 1 G 1 n 
6841 ACTGCACCATGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCC 
TGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGG 

GluAspAlaAlaSerLeuArgAlaPheThrGluAlaMetThrArgTyrSerAlaProPro 
6901 AG6AGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCC 
TCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGG 

GlyAspProProGlnProGluTyrAspLeuGluLeulleThrSerCysSerSerAsnVal 
6961 CTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACG 
GACCCCT6GGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGC 

SerValAlaHisAspGlyAlaGlyLysArgValTyrTyrLeuThrArgAspProThrThr 
7021 TGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAA 
ACAGTCAGCGGGTGCTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTT 
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FIG. 54A 

ArgSerArgAsnLeuGlyLysVall leAspThrLeuThrCysGlyPheAlaAspLeuMet 
1 AGGTCGCGCAATTTGG6TAAG6TCATC6ATACCCTTACGT6CGGCTTC6CC6ACCTCAT6 
TCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCACGCCGAAGCGGCTGGAGTAC 

GlyTyrlleProLeuValGlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHisGly 
61 GGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGGC 
CCCATGTATGGCGAGCAGCCGCGGGGAGAACCTCCGCGACGGTCCCGGGACCGCGTACCG 

ValArgValLeuGluAspGlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPhe 
121 GTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTTC 
CAGGCCCAAGACCTTCTGCCGCACTTGATACGTTGTCCCTTGGAAGGACCAACGAGAAAG 

Se r 1 1 ePheLeuLeuA 1 aLeuLeuSe rCysLeuTh rVa 1 ProA 1 aSe r A 1 aTy rG 1 nVa 1 
181 TCTATCTTCCTTCTGGCCCTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGTG 
AGATAGAAGGAAGACCGGGACGAGAGAACGAACTGACACGGGCGAAGCCGGATGGTTCAC 

ArgAsnSerThrGlyLeuTyrHisValThrAsnAspCysProAsnSerSerlleValTyr 
241 CGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTAC 
GCGTTGAGGTGCCCCGAAATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACATG 

G 1 uA 1 aA 1 a AspA 1 a 1 1 eLeuH i sTh rProG lyCysVa 1 ProCys Va 1 ArgG 1 uG 1 yAsn 
301 GAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAAC 
CTCCGCCGGCTACGGTAGGACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTTG 

AlaSerArgCysTrpValAlaMetThrProThrValAlaThrArgAspGlyLysLeuPro 
361 GCCTCGAGGTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGAT6GCAAACTCCCC 
CGGAGCTCCACAACCCACCGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGG 

AlaThrGlnLeuArgArgHisIleAspLeuLeuValGlySerAlaThrLeuCysSerAla 
421 GCGACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGCC 
CGCTGCGTCGAAGCTGCAGTGTAGCTAGACGAACAGCCCTCGCGGTGGGAGACAAGCCGG 

LeuTyrValGlyAspLeuCysGlySerValPheLeuValGlyGlnLeuPheThrPheSer 
481 CTCTACGTGGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTCT 
GAGATGCACCCCCTGGATACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAGA 

ProArgArgHisTrpThrThrGlnGlyCysAsnCysSerlleTyrProGlyHisIleThr 
541 CCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACG 
GGGTCCGCGGTGACCTGCTCCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGC 

GlyHisA rgMetA 1 aTrpAspMetMetMetAsnTrpSerProTh rThr A 1 aLeuVa lMet 
601 GGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAATG 
CCAGTGGCGTACCGTACCCTATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTAC 
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FIG 54B 

661 giMM 

ThrValSerGlyPheValSerLeuLeuAlaPro61yAlaLys61nAsnVal6lnLeuIle 

ACTGTGTCTGGATTTGTTAGCCTC^ 
TGACACAGACCTAAACAATCGG^ 

""Wmmmmmmmm 

GlyValGluHi sArgLeuG luAlaAl aCys AsnTrpThr ArgG lyG JuArgCysAspLeu 

GluAspArgAspArgSerGluLeuSerProLeuLeuLeuThrThrThrGl^ 
1^1 | f 



1Z01 

1261 

1321 

1381 

CC 

1111 



^^pn^^^SerPheThrThrLeuProAlaLeuSerThrGlyLeuI leHisLeuHisGln 

17.1 aTc^M 

TTGTAACACCT6CACGTCATGAAQ 

LysTrpGluTyrValValLeuLeuPheLeuLeuLeuAlaAspAlaArgValCysSerCys 

1801 AAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGC 
1801 ^bTbfabAfalAtblLbMUtu 

LeuTr pMetMetLeuLeu 1 1 eSe rG 1 nA 1 aG 1 u A 1 a A 1 aLeuG 1 uAsnleuVa 1 1 1 eLeu 
1861 TTGTGGATGATGCTACTCATATCCCAAGCGGAGG^ 

AACACCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAA 

AenA 1 aA 1 aSer LeuA laG 1 yTh rH i sG 1 yLeuVa 1 SerPheLeuVa 1 PhePheCysPhe 

1921 AATGCAGCATCCCTGGCCGGGAC^ 

TTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAA 

AlaTrDTyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrp 

1981 GCATGGTATTTGAAGGGTAAGTG 

CGTACCATAAACTTCCCATTCAC.CCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACC 

Prnl pul puLeuLeuLeuLeuAlaLeuProGlnArgAlaTyrAlaLeuAspThrGluVal 
2041 CCTcfccfcCTGCTCCTGTTGGCGTTG^ 
GGAGAGGAG6ACGAGGACM 

AlaAlaSerCysGlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyr 

2101 GCCGCGTCGTGTGGCGGTGTTGTT 

CGGCGCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATA 

TyrLysArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGlu 

2161 WMimmmmmLmmiffim 

AlaGlnL euHisValTrDlleProProLeuAsnValArgGlyGlyArgAspAlaVall le 
LeuLeuMe tCvs A 1 aVa 1 H i sProTh rLeuVa 1 PheAsp 1 1 eThrLysLeuLeuLeuA 1 a 

CAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCG 
ValGlnG 1 vLeuLeuA rgPheCysA 1 aLeuA laArgLy sMet I leG 1 yG 1 yH i sTy rVa 1 

21101 WMnslmmmmmmm& 

GTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATT6GTAGAGTGA 

ProLeuArgAspTrpA 1 aH i sAsnG 1 y^\}^9^Q^^i^^\^l^rT\rl i 
2521 CCTCTTCGGGACTGGGCGCACAACCGCTTGCGAGATCTGGCCGTGGCTGTAGA^ 

G6AGAAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAG 

ValPheSerGlnMetGluThrLysLeuIleThrTrpGlyAlaAspThr^ 

2581 GTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGG^ 

CAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCCA 

AspIlelleAsnGlyLeuProValSerAlaArgArgGlyArgGluIle^ 

26,1 mwmmimmmmmmm^ 



FIG. 54D 

A 1 a AspG 1 yMe tVa 1 Se r Ly sG 1 yTr pArgLeuLeuA 1 aPro 1 1 eTh rA 1 aTy r A 1 aG 1 n 
2701 GCCGATGGAATGGTCTCCAAGGGGTGGAG6TTGCTGGCGCCCATCACGGCGTACGCCCAG 
CGGCTACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTC 

GlnThrArgGlyLeuLeuGlyCysIlelleThrSerLeuThrGlyArgAspLysAsnGln 
2761 CAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAA 
GTCTGTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTT 

ValGluGlyGluValGlnlleValSerThrAlaAlaGlnThrPheLeuAlaThrCysIle 
2821 GTGGAGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATC 
CACCTCCCACTCCAGGTCTAACACAGTTGACGACG6GTTTGGAAGGACCGTTGCACGTAG 

AsnGlyValCysTrpThrValTyrHisGlyAlaGlyThrArgThrlleAlaSerProLys 
2881 AATGGGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAG 
TTACCCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTC 

G 1 y ProVa HelGlnMetTy rTh rAsnVa 1 AspG 1 nAspLeuVa 1 G 1 yTrpPro A 1 aPro 
2941 GGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCG 
CCAGGACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGC 

GlnGlySerArgSerLeuThrProCysThrCysGlySerSerAspLeuTyrLeuValThr 

3001 caaggtagccgCtcattgacaccctgcacttgcggctcctcggacctttacctggtcacg 
gttccatcggcgagtaactgtgggacgtgaacgccgaggagcctggaaatggaccagtgc 

ArgHisAlaAspVallleProValArgArgArgGlyAspSerArgGlySerLeuLeuSer 
3061 AGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCG 
TCCGTGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGC 

ProArgProIleSerTyrLeuLysGlySerSerGlyGlyProLeuLeuCysProAlaGly 
3121 CCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGG 
GGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCC 

HisAlaValGlyllePheArgAlaAlaValCysThrArgGlyValAlaLysAlaValAsp 
3181 CACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAG6CGGTGGAC 
GTGCGGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTG 

Phe 1 1 eProVa 1 G 1 uAsnLeuG 1 uTh rTh rMe t A rgSe rP roVa 1 PheThr AspAsnSer 
3241 TTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCC 
AAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGG 

Se rProProVa 1 Va 1 ProG 1 nSe rPheG InValAlaHl sLeuH i sA 1 aProThrG 1 ySer 
3301 TCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTGACCTCCATGCTCCCACAGGCAGC 
AGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCG 

G 1 yLysSerThrLysVa 1 ProA 1 aA 1 aTy rA 1 aA 1 aG 1 nG lyTyrLysVa 1 LeuVa 1 Leu 
3361 GGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACTC 
CCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGAG 

AsnProSerVa 1 A 1 aA 1 aTh r LeuG 1 yPheG 1 yA 1 aTy rM^ 
3421 AACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGATC 
TTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAG 

AspProAsnlleArgThrGlyValArgThrlleThrThrGlyS^^ 
3481 GATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCC 
CTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAGG 

Th rTy rG 1 y Ly s PheLeuA 1 aAspG 1 yG 1 yCysSe rGlyGlyAl aTy rA sp 1 1 e 1 1 e 1 1 e 
3541 ACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAATT 
TGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTAA 

CysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThrValLeuAsp 
3601 TGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGAC 

ACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACTG -^ut^ 



FIG. 54E 

G 1 n A 1 aG 1 uThrA 1 aG lyAlaArgLeuVa 1 Va 1 LeuA 1 aTh r A 1 aThrProP roG 1 ySe r 
3661 CAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTCC 
GTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGG 

ValThrValProHisProAsnlleGluGluValAlaLeuSerThrThrGlyGluIlePro 
3721 GTCACT6TGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCCT 
CAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGGA 

PheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeuIlePheCys 
3781 TTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTGT 
AAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACA 

HisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeuGlylleAsnAla 
3841 CATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGCC 
GTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGG 

Va 1 A 1 aTy rTy r A rgG 1 y LeuAspVa 1 Se rVa 1 1 1 eProTh rSe rG lyAspVa 1 Va 1 Va 1 
3901 GTGGCCTACTACCGCGGTCTTGACGT6TCCGTCATCCCGACCAGCGGCGATGTTGTCGTC 
CACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAG 

Va 1 A 1 aThr AspA 1 aLeuMetThrG 1 yTy rThrG 1 yAspPheAspSerVa 1 1 1 eAspCys 
3961 GTGGCAACCGATGCCCTCATGACCGGCTATACCG6CGACTTCGACTCGGTGATAGACTGC 
CACCGTTGGCTACGGGAGTACTGGeCGATATGGCCGCTGAAGCTGAGCCACTATCTGACG 

AsnThrCysValThrGlnThrValAspPheSerLeuAspProThrPheThrlleGluThr 
4021 AATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGACA 
TTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGT 

IleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArgThrGlyArgG^ 
4081 ATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGG 
TAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCC 

Ly s ProG 1 y 1 1 eTy rArgPheVa 1 A 1 aP roG 1 yG 1 u A r gP roSe rG 1 yMe tPheAspSe r 
4141 AAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCC^ 

TTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAGC 

Se rVa 1 LeuCysG 1 uCysTy rAspA 1 aG 1 yCysAl aTrpTy rG 1 uLeuThrProA 1 aG 1 u 
4201 TCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAG 
AGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTC 

ThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProValCysGlnAspHis 
4261 ACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGA^ 

TGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGTA 

LeuG 1 uPheTrpG 1 uG 1 y Va 1 PheThrG 1 yLeuTh rH i s 1 1 eAspA 1 aH i sPheLeuSe r 
4321 CTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCT^ 

GAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGG 

G 1 nTh r LysG 1 nSe rG 1 yG 1 u AsnLeuProTy rLeuVa 1 A 1 aTy rG 1 nA 1 aTh rVa 1 Cys 
4381 CAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGT^ 

GTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACG 

AlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLvsCysL^ 
4441 GCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTC 
CGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAG 

LysProThrLeuH i sG 1 yProThrProLeuLeuTy rArgLeuG 1 yAlaValGlnAsnG 1 u 
4501 AAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGn 

TTCGGGTGGGA6GTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTT 

IleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSerAlaAspL^^^ 
4561 ATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGAG 
TAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTC 

Va 1 Va 1 Th rSe rThrTr pVa 1 LeuVa 1G 1 yG lyVa lLeuAl aA 1 aLeuA 1 aA 1 aTv rCy s 
4621 GTCGTCACGAGaCCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCWCGTATTGC 
CAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACG 
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Le uSe rTh rG 1 y Cys Va 1 Va 1 1 1 eVa 1 6 1 y A rgVa 1 Va 1 LeuSe rG 1 y LysP r o A 1 a 1 1 e 
4681 CTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATC 
GACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAG 

HeProAspArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHis 
4741 ATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCAC 
TATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTG 

LeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGly 
4801 TTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGC 
AATG6CATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCG 

LeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAspTrp 
4861 CTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGG 
GAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACC 

G InLysLeuG 1 uTh rPheTr pA 1 aLysH i sMe tTrpAsnPhe 1 1 eSe rG 1 y 1 1 eG 1 nTy r 
4921 CAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATAC 
GTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATG 

LeuAlaGlyLeuSerThrLeuProGlyAsnProAlal leAlaSerLeuMetAlaPheThr 
4981 TTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACA 
AACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGT 

AlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGly 
5041 GCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGG 
CGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCC 

TrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeu 
5101 TGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTA 
ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAAT 

AlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGly 
5161 GCTGGCGCCGCCATCGGCAGTGTTG6ACTGGGGAAGGTCCTCATAGACATCCTTGCAGGG 
CGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCC 

Ty rG 1 yA laG lyVa 1 A 1 aG 1 yA 1 aLeuVa 1 A 1 aPheLys 1 1 eMetSerG 1 yG 1 uVa 1 Pro 
5221 TATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCC 
ATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGG 

SerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeuValVal 
5281 TCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTC 
AGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAG 

GlyValValCysAlaAlalleLeuArgARgHisValGlyProGlyGluGlyAlaValGln 
5341 GGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCAG 
CCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTC 

TrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProThrHisTyr 
5401 TGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTAC 
ACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATG 

Va 1 ProG luSer AspA 1 aA 1 aA 1 a A rgVa 1 Th r A 1 a 1 1 eLeuSerSe r 
5461 GTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACC 
CACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATT6G 

G 1 nLeuLeu A rgA rgLeuH i sG 1 nTrp 1 1 eSerSerG 1 uCysTh rTh rProCy sSe rG 1 y 
5521 CAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACC^ 

GTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCA 

SerTrpLeuArgAspI leTrpAspTrplleCysGluValLeuSerAspPheLysThrTrp 
5581 TCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGG 
AGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACC 

LeuLysAlaLysLeuMetProGlnLeuProGlylleProPheValSerCysGlnArgGly 
5641 CTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGG 
GATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCC 



, jigsaws 
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Ty rLy sG ly Va lTrpA rgVa 1 Asp6 1 y I leMetH 1 sTh rA rgCysH i sCy sG 1 yA 1 aG 1 u 
5701 TATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGAG 
ATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACTC 

IleThrGlyHisValLysAsnGlyThrMetArglleValGlyPRoArgThrCysArgAsn 
5761 ATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAAC 
TAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTTG 

MetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGlyProCysThrProLeuPro 
5821 ATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCCT 
TACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGAAGGA 

AlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyrValGluIleArg 
5881 GCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAGG 
CGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCC 

G 1 nVa 1 G 1 yAspPheH i sTy rVa ITh rG 1 yMetTh rTh r AspAsnLeuLy sCysProCys 
5941 CAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTGC 
GTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCACG 

G 1 nVa 1 ProSerP roG 1 uPhePheTh rG 1 uLeuAspG 1 y Va 1 A rgLeuH i s A rgPheA 1 a 
6001 CAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGCG 
GTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACGC 

ProProCysLysProLeuLeuA rgG 1 uGl uVa 1 Se rPheA rgVa 1 G 1 yLeuH i sG 1 uTy r 
6061 CCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATAC 
GGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCTTATG 

ProVa 1 G lySerG 1 nLeuProCysG 1 uProG 1 uP roAspVa 1 A 1 aVa 1 LeuTh rSe rMet 
6121 CCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCATG 
GGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAGGTAC 

LeuTh rAspP roSe rH i s 1 1 eTh rA 1 aG 1 u A 1 aA 1 aG 1 y A rg A rgLeuA 1 a A rgG 1 ySe r 
6181 CTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATCA 
GAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCCTAGT 

ProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAlaThr 
6241 CCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAACT 
GGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGA 

CysThrAlaAsnHisAspSerProAspAlaGluLeuIleGluAlaAsnLeuLeuTrpArg 
6301 TGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGG 
ACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCC 

61 nG 1 uMetG 1 yG 1 yAsn I leThrA rgVa 1 G luSerG 1 uAsnLysVa 1 Va 1 1 leLeuAsp 
6361 CAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGA 
GTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTG 

SerPheAspProLeuValAlaGluGluAspGluArgGluIleSerValProAlaGluIle 
6421 TCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAM 
AGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAG 

LeuArgLysSerArgArgPheAlaGlnAlaLeuProValTrpMaArgProAspTyrA^ 
6481 CTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAAC 
GACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTG 

ProProLeuValGluThrTrpLysLysProAspTyrGluProProValValHisGlyCys 
CCCCCGCTAGTGGA6AC6TGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGGTGT 
GGGGGC6ATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACA 

ProLeuProProProLysSerProProValProProProArgLysLysArgThrValVal 
6601 CCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCGTCGGAAGAAGCGGACGGTGGTC 
GGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCAG 

LeuTh rG 1 uSe rTh r LeuSe rTh r A 1 aLeu A 1 aG 1 uLeuA 1 aTh rA rgSe rPheG 1 ySe r 
6661 CTCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTGGCAGC 
GAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTCG 



6541 
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Se r Se rTh rSerG ly 1 1 eTh rG 1 yAspAsnTh rTh rTh rSe rSe rG 1 uP ro A 1 aProSe r 
6721 TCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCT^ 
AGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGGAAGA 

GlyCysProProAspSerAspAlaGluSerTyrSerSerMetProProLeuGluGlyGlu 
6781 GGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGGAG 
CCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCTC 

6mi EE¥aiS8HK^i8^ 

GGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCATCACTCCGGTTGCGC 

G 1 uAspVa 1 va 1 CysCysSe rMe tSe rTy rSerTrpTh rG 1 yA 1 aLeuVa 1 ThrProCys 
6901 GAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCW 

CTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCACG 

AlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeuLeuArgHis^ 
6961 GCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCAC 
CGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCAACGATGCAGTGGTG 

AsnLeuValTyrSerThrThrSerArgSerAlaCysGlnArgGlnU^ 
7021 AATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATTT 
TTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTGTAAA 

AspARgLeuG 1 nVa 1 LeuAspSe r H i sTy rG 1 nAspVa 1 LeuLysG 1 uVal LysA 1 aA 1 a 
7081 GACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAGGTTAM 

CTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCGC 

AlaSerLysValLysAlaAsnLeuLeuSerValGluGluAlaCysSerLeuTh^ 
7141 GCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCCA 
CGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCCTTCGAACGTCGGACTGCGGGGGT 

H i sSe r A laLysSerLysPheG 1 yTy rG 1 yA laLysAspVa 1 A rgCysH i sA laArgLys 
7201 CACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCCGTT^ 

GTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTCTGCAGGCAACGGTACGGTCTTTC 

AlaValThrHisIleAsnSerValTrpLysAspLeuLeuGluAspAsnValTh^ 
7261 GCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACAATGTAACACCAATA 
CGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGTTACATTGTGGTTAT 

AspTh rTh r 1 1 eMetA laLysAsnG 1 uVa 1 PheCysVa 1 G i nProG 1 uLysG 1 yG lyA rg 
7321 GACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTC^ 

CTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGCAAGTCGGACTCTTCCCCCCAGCA 

LysProAlaArgLeuIleValPheProAspLeuGlyyalArgValCysGlu^ 
7381 AAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGAT 

TTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCACACGCTTTTCTACCGA 

LeuTyrAspValValThrLysLeuProLeuAlaValMetGlySerSe^yrGlyPheGln 

7M\ TTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGM 

AACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGAGGATGCCTAAGGTT 

Ty rSerProG lyG InA rgVa 1 G 1 uPheLeuVa 1 G 1 nA 1 aTrpLysSerLysLysThrPro 
7501 TACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGM 

ATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGT 

Me tG 1 y PheSe rTy r AspTh r A rgCy sPheA spSerTh rVa 1 Th rG 1 uSerAspI leArg 
7561 ATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACA^ 

TACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCA 

ThrGluGluAlalleTyrGlnCysCysAspLeuAspProGlnA^ 
7621 ACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGC^ 

TGCCTCCTCCGTTAGATGGTTACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTTC 

Se rLe uTh rG 1 uA rg LeuTy rVa 1 G 1 yG 1 y P roLeuTh r AsnSe r A rgG 1 yG 1 uAsnCys 
7681 TCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAAT^ 

AGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACG 
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G 1 yTy rA rgA rgCysA rgA 1 aSe rG 1 y Va 1 LeuTh rThrSe rCysG 1 yAsnTh rLeuTh r 
7741 GGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCACT 
CCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGA 

CysTyrlleLysAlaArgAlaAlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeu 
7801 TGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTC 
ACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGAG 

_ ValCysGlyAspAspLeuValVallleCysGluSerAlaGlyValGlnGluAspAlaAla 
7861 GTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCG 
CACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGC 

SerLeuArgAlaPheThrGluAlaMetThrArgTyrSerAlaProProGlyAspProPro 
7921 AGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCCA 
TCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGGT 

G 1 nProG 1 uTy r AspLeuG 1 uLeu 1 1 eTh rSe rCy sSe rSe rAsnVa 1 Se rVa 1 A 1 aH i s 
7981 CAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCAC 
GTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGCACAGTCAGCGGGTG 

AspGlyAlaGlyLysArgValTyrTyrLeuThrArgAspProThrThrProLeuAlaArg 
8041 GACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGA 
CTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCT 

AlaAlaTrpGluThrAlaArgHisThrProValAsnSerTrpLeuGlyAsnllelleMet 
8101 GCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATG 
CGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTAC 

PheA laProThrLeuTrpA 1 aA rgMetl 1 eLeuMe tTh rH i sPhePheSerVa 1 Leu 1 1 e 
8161 TTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATA 
AAACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATAT 

AlaArgAspGlnLeuGluGlnAlaLeuAspCysGluI leTyrGlyAlaCysTyrSerl le 
8221 GCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCATA 
CGGTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTAT 

GluProLeuAspLeuProProI lei leGlnArgLeu 
8281 GAACCACTTGATCTACCTCCAATCATTCAAAGACTC 
CTTGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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FIG. 62A 

Q^fKCACCATGAATCACTCCCCTGTGAGGAACTACTGTCTTCACGCAGAAAGCGTCTAG 
CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
§I§§IQIQQ§§^£QGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG-300 

_ . — (Putative initiator methionine codon) 

GTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAA 
CACCAACCGTCGCCCACAGGAW 

JJACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGA 

rArrT^^T99I^^^I^§^^^^^^^TATCCCCAAGGCTCGTCGGCCCGAGGGCAG 

GACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGC-600 

TAr^x^rJ9AT§lTl^£95I^^^^^^^^^^^^TGGGGCCCCACAGACCCCCGGCG 

I^§§I?§?§?^III55§I^^^^ATCGATACCCTTACGTGCGGCTTCGCCGACCTCAT 

GGGGTACATACCGCTCGTCGGCGCCCCTCTTGG 

p^TCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTT 
PSJ^T^J^JJ^^^T^T CTCTT6CTTGACTGT6CCCGCTTCGGC CTACCAAGT-900 

gcgcaactccacggggctttaccacgtcaccaatgattgccctaactcgagtattgtgta 
cgaggcggccgatgccatcctgcacactccggggtgcgtcccttgcgttcgtgagggcaa 
cgcctcgaggtgttgggtggcgatgacccctacggtgg 
cgcgacgcagcttcgacgtcaca 

cctaacgtgggggacctatgcgggtctgtctttcttgtcggccaactgttcaccttctc-1200 

rrrtr«^^«t^x^^ ga ^ ggaagg ttgcaattgctctatctatcccggccatataac 

gggtcaccgcatggcatgggatatgatgatgaactggtcccctacgacggcgttggtaat 

ggctcagctgctccggatcccacaagccatcttggacatgatcgctggtgctcactgggg 

a g tcctggcgggcatagcgtatttctccatggtggggaactgggcgaaggtcctggtagt 

g £tgctgctatttgccggc 

caagtgtctggatttgttagcctcctcgcaccaggcgccaagcagaacgtccagctgat 

caacaccaacggcagttggcacctcaatagcacggccctgaactgcaatgatagcctcaa 

caccggctggttggcagggcttttctatcaccacaagttcaactcttcaggctgtcctga 

gaggctagccagctgccgaccccttaccgattttgaccagggctggggccctatcagtta 

t g ccaacggaagcggccccgaccagcgcccctactgctggcac 

cggtattgtgcccgcgaagagtgtgtg^ 

gg j gg j ggg aacgaccgacaggtcgggcgcgcccacctacagctggggtgaaaatgatac 

^^^ipij9§i^it^ caataccaggccag cgctgggcaattggttcggttgtacctg 

^i^ aa ^km ctggattcaccaaag tgtgcggagcgcctccttgtgtcatcggaggggc 

ggg caacaacaccctgcactgccccactgattgcttccgcaagcatccggacgccacata-2100 

cjctcggtgcggctccggtccctggatcacacccaggtgcctggtcgactacccgtatag 

gcmggcattatcgtgtaccatcaactacaccatatttaaaatcaggatgtacgt 

^§^t^ g ^^ a ^ a ^^ gaagctgcctgcaa ctggacgcggggcgaacgttgcgatct 

ggaagacagggacaggtccga^ 

^k c ^i g i tccttcacaacc ctaccagccttgtccaccggcctcatccacctccacca-2400 

$ aa ^ii5t^ acgtgcagtact t g tacggggtggggtcaagcatcgcgtcctgggccat 

jaactgggagtacgtcgttctcctgttccttctgcttgcagacgcgcgcgtctgctcctg 

xIItJ^a^J9tI^I^^^ a ^ a ^ gg ^ aag ^ ggagg ^ gg ^ttggagaacctcgtaatact 

taatgcagcatccctggccgggacgcacggtcttgtatcctt 

tgcatggtatttga^ 

gccjacctcctgctcctgttggcgttgccccagcgggcgtacgcgctggacacggaggt 
ggccgcgtcgtgtggcggtgttgttctcgtcgggttgatggcgctgactctgtcaccata 

TJ^oa«^£I a I^I£ a ^^ gg ^ g ^^ g ^ gg ^ gg ^^^ ag TATTTTCTGACCAGAGTGGA 

agcgcaactgcac^gtggattccccccctcaacgtccga 

cjta^catgt^gctgtacacccgactctggtatttgacatcaccaaattgctgctggc-3000 

^kji9^^ tt ^ii tggattcttcaag ccagtttgcttaaagtaccctactttgtgcg 

cgtccaaggccttctccggttctgcgcgttagcgcggaagatgatcggaggccattacgt 

g caaatg^catcattaagttaggggcgcttactggcacctatgtttataaccatctcac 

jccjmcgggactgggcgcacaacggcttgcgagatctggccgtggctgtagagccagt 

tPJPTKItCcaaatggag^ 

j ga catcatcaacggcngcctgtttccgcccgcaggggccgggagatac 

^^^i^^i^kk caaggggtggaggttg ctggcgcccatcacggcgtacgccca 

gcagacaaggggcacaagggtgcataatcaccagcctaactggc 

AGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCAT 





FIG. 62B 



CAATGG6GTGTGCTG6ACTGTCTACCACG6GGCCGGAACGAG6ACCATCGCGTCACCCAA-3600 

GGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCC 

GCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCAC 

GAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTC 

GCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGG 

GCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGA-3900 

CTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTC 

CTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAG 

CGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACT 

CAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGAT 

CGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTC-4200 

CACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAAT 

TTGTGACGAGTGCCACTCCAGGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGA 

CCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTC 

CGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCC 

TTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTG-4500 

TCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGC 

CGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGT 

CGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTG 

CAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACGCTACCTTCACCATTGAGAC 

AATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGG-1800 

GAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTC 

GTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGA 

GACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCA 

TCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATC 

CCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTG-5100 

CGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCT 

CAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGA 

AATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGA 

GGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTG 

CCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAAT-5400 

CATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCA 

CTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGG 

CCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTG 

GCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATA 

CTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTAC-5700 

AGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGG 

GTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTT 

AGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGG 

GTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCC 

CTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGT-6000 

CGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCA 

GTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTA 

CGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAAC 

CCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGG 

TTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTG-6300 




FIG. 62C 

^T^^§QIM5CJCATGCCACAGCT6CCTGGGATCCCCTTTGTGTCCTGCCAGCGCG6 
^I^5^^KI^9 GAGTG6ACGGCATC ATGCACACTCGCTGCCACTGTGGAGCTGA 
GATCACTGGACATGTCAAAAAC^ 

TrJ^X^^«I$§5^KIK^^ A ^ AA ^ G ^^ACACCACGGGCCCCTGTACCCCCCTTCC 

TGC6CC6AACTACACGTTC^ 

GCAGGTGGGGGACTTCCACTACGTGACGGGTATGACT^ 

CCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGC 

GCCCCCgGCAAGCCCnGCTGCGGGAGGAGGTATCATTCAGAGTAGGA 

CCCGGTAGGGTC6CAATTACCm 

6CTCACTGATCCCTCCCATATAACAGCAGAGGW 

ACCCCCGTCTGTGGCCAGCTCCTCGG^ 

JJ^P^^^^^J§ A 9K^QI GATGCTGAG CTCATAGAGGCCAACCTCCTATGGAG 
GCAGGAGATGGGCGGCAACA^ 

CTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAAT 

CCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACT 

CCCCCCGCTAGTGGAGACGTGGAAAAAGCC^ U 

TCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGT 

CaCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGC^ 

CTCCTCAACnCCGGCATTACGGGCGACAATACGACAACATCCT 

JS&IS&SSCgACJttG^ 

GCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTAGTGAGGCCAACGC 

rrrrrr J^I99TaJ9£T5^^ aa ^ g ^^tactcttggacaggcgcactcgtcaccccgtg 

CGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCA^ 

GGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCC 
?^^ A ^^ A ^I^ AA ^ TTGGTTATGGGGCAAAAGA C G TCCGTTGCPTGCCAGAAA 
^9^J^ A ^9^ ATCAACTCCGTGTGGAAAGA CCTTCTGGAAGACAATGTAACACCAAT 
JJJSSSIACCATCATGG^ 

in^^^J G ^II^^ AGCTCCCCTTGGCCG T GA TGGGAAGCTCCTACGGATTCCA 
JKPISJSA^CAG^ 

MJ^^JI^J^ G T^I5 A I59 CCGCTGCTTTGACTCCA CAGTCACTGAGAGCGACATCCG 
TACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAA 
GTCCCTCACCGAGAGGCmAT 

JTGCTACATCAAGGCCCGGGCAGCCTGTCGA 

^^9J^ 6A $^IK^ GAGGCTATGACCAGG TACTCCGCCCCCCCTGGGGACCCCCC 
^ A ^^^MI^ GA £ TTGGAGCTGATAA CATCATGCTCCTCCAACGTGTCAGTCGCCCA-8700 
CGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAG 
AGCTGCGTGGGAGACAGCAA^ 

ffn^^9^^^I9^9^ AGGA TGATACTGATGACCCATTTCTTTAGCGTCCTTAT 

^ A ^^II^KI A ^K9 AATCAnCAAAGA CTCCATGGCCTCAGCGCATTTTCACT-9000 

CCACAGTTACTCTCCAGGTGAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGT 

ACCGfCCTTGCGAGCTTG^ 

AGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCT 
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1 CACTCCACCATGAATCACTCCCCT6TGAGGAACTACTGTCTTCACGCAGAAA6CGTCTAG 
GTGAGGTGGTACTTAGTGAGGGGACACTCCTTGATGACAGAAGTGCGTCTTTCGCAGATC 

61 CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
GGTACCGCAATCATACTCACAGCACGTCGGAGGTCCTGGGGGGGAGGGCCCTCTCGGTAT 

121 GTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
CACCAGACGCCTTGGCCACTCATGTGGCCTTAACGGTCCTGCTGGCCCAGGAAAGAACCT 

181 TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
AGTTGGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACGATCGGCTCATCA 

211 GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG 
CAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAACGCTCACGGGGCCCTC 

301 GTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAA 
CAGAGCATCTGGCACGTGGTACTCGTGCTTAGGATTTGGAGTTTTTTTTTTGTTTGCATT 

361 CACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGT 
GTGGTTGGCAGCGGGTGTCCTGCAGTTCAAGGGCCCACCGCCAGTCTAGCAACCACCTCA 

421 TTACTT6TTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGA 
AATGAACAACGGCGCGTCCCCGGGATCTAACCCACACGCGCGCTGCTCTTTCTGAAGGCT 

481 GCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAG 
CGCCAGCGTTGGAGCTCCATCTGCAGTCGGATAGGGGTTCCGAGCAGCCG6GCTCCCGTC 

541 GACCT6GGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCTGC6GGTGGGC 
CTGGACCCGAGTCGGGCCCATGGGAACCGGGGAGATACCGTTACTCCCGACGCCCACCCG 

601 GGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCG 
CCCTACCGAGGACAGAGGGGCACCGAGAGCCGGATCGACCCCGGGGTGTCTGGGGGCCGC 

661 TAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCAT 
ATCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCACGCCGAAGCGGCTGGAGTA 

721 GGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGG 
CCCCATGTATGGCGAGCAGCCGCGGGGAGAACCTCCGCGACGGTCCCGGGACCGCGTACC 

781 CGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTT 
GCAGGCCCAAGACCTTCTGCCGCACTTGATACGTTGTCCCTTGGAAGGACCAACGAGAAA 

841 CTCTATCTTCCTTCTGGCCCTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGT 
GAGATAGAAGGAAGACCGGGACGAGAGAACGAACTGACACGGGCGAAGCCGGATGGTTCA 

901 GCGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTA 
CGCGTTGAGGTGCCCCGAAATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACAT 

961 CGAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAA 
GCTCCGCCGGCTACGGTAGGACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTT 

1021 CGCCTCGAGGTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCC 
GCGGAGCTCCACAACCCACCGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGG 

1081 CGCGACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGC 
GCGCTGCGTCGAAGCTGCAGTGTAGCTAGACGAACAGCCCTCGCGGTGGGAGACAAGCCG 

1111 CCTCTACGTGGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTC 
GGAGATGCACCCCCTGGATACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAG 

1201 TCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAAC 
AGGGTCCGCGGTGACCTGCTGCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTG 

1261 GGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAAT 
CCCAGTGGCGTACCGTACCCTATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTA 




FIG. 62E 

1321 GGCTCAGCTGCTCCG6ATCCCACAA6CCATCTT6GACAT6ATC6CTGGTGCTCACT6GGG 
CCGAGTCGACGAGGCCTAGGGTGTTCGGTAGAACCTGTACTAGCGACCACGAGTGACCCC 

1381 AGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGT 
TCAGGACCGCCCGTATCGCATAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCA 

mm GCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCA 
CGACGACGATAAACGGCCGCAGCTGCGCCTTTGGGTGCAGTGGCCCCCTTCACGGCCGGT 

1501 CACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGAT 
GTGACACAGACCTAAACAATCGGAGGAGCGTGGTCCGCGGTTCGTCTTGCAGGTCGACTA 

1561 CAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAA 
GTTGTGGTTGCCGTCAACCGTGGAGTTATCGTGCCGGGACTTGACGTTACTATCGGAGTT 

1621 CACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGA 
GTGGGCGACCAACCGTCCCGAAAAGATAGTGGTGTTCAAGTTGAGAAGTCCGACAGGACT 

1681 GAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTA 
CTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCCCGGGATAGTCAAT 

1741 TGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTG 
ACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGGGGGGTTTTGGAAC 

1801 CGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGT 
GCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGTGAGGGTCGGGGCA 

1861 GGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATAC 
CCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCCCACTTTTACTATG 

1921 GGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGTTCGGTTGTACCTG 
CCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCAAGCCAACATGGAC 

1981 GATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTGTCATCGGAGGGGC 
CTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACACAGTAGCCTCCCCG 

2041 GGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATA 
CCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAGGCCTGCGGTGTAT 

2101 CTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCGACTACCCGTATAG 
GAGAGCCACGCCGAGGCCAGGGACCTAGTGTG6GTCCACGGACCAGCTGATGGGCATATC 

2161 GCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCAGGATGTACGTGGG 
CGAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGTCCTACATGCACCC 

2221 AGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCT 
TCCCCAGCTTGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGCTTGCAACGCTAGA 

2281 GGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTACACAGTGGCAGGT 
CCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGATGTGTCACCGTCCA 

2341 CCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCA 
GGAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGTAGGTGGAGGTGGT 

2401 GAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCGCGTCCTGGGCCAT 
CTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGCGCAGGACCCGGTA 

2461 TAAGTGGGAGTAC6TCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTG 
ATTCACCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGAC 

2521 CTTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACT 
GAACACCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGA 

2581 TAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTT 
ATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAA 
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FIG. 62F 

2611 TGCATGGTATTT6AAGGGTAAGTG6GT6CCC6GAGCGGTCTACACCTTCTACGG6ATGTG 
ACGTACCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACAC 

2701 GCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGT 
CGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCA 

2761 GGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCG^ 

CCGGCGCAGGACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTAT 

2821 TTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGA 
AATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCT 

2881 AGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCAT 
TCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCC6CGCTGCGGCAGTA 

2941 CTTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGC 
GAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCG 

3001 CGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCG 
GCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGC 

3061 CGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGT 
GCAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCA 

3121 GCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCAC 
CGTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTG 

3181 TCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGT 
AGGAGAAGCCCTGACCC6CGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCA 

3211 CGTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCGTGCGG 
GCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCC 

3301 TGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCC 
ACTGTAGTAGTTGCCGAAGGGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGG 

3361 AGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGGCCATCAGGGCGTACGCCCA 
TCGGCTACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGT 

3421 GCAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCA 
CGTCTGTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGT 

3481 AGTGGAGGGTGA6GTCCAGATTGTGTCAACTGCTGCCCAAACCTTC 

TCACCTCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTA 

3541 CAATGGGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAA 
GTTACCCCACACGACCTGACAGAT6GT6CCCCGGCCTTGCTCCTGGTA6CGCAGTGGGTT 

3601 GGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCC 
CCCAGGACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGG 

3661 GCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCAC 
GCTTCCATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTG 

3721 GAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAG6GGCAGCCTGCTGTC 
CTCCGTGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAG 

3781 GCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGG 
CGGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCC 

3841 GCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGA 
CGTGCGGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCT 

3901 CTTTATCCCTGTGGAGAACCtAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTC 
GAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAG 



FIG. 62G 



3961 CTCTCCACCAGTAGTGCCCCAGAGCTTCCA6GTGGCTCACCTCCATGCTCCCACAGGCAG 
GAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTC 

4021 CGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACT 
GCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGA 

1081 CAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGAT 
GTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTA 

4141 CGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTC 
GCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAG 

4201 CACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAAT 
GTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTA 

4261 TTGTGACGAGT6CCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGA 
AACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACT 

4321 CCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTC 
GGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAG 

4381 CGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCC 
GCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGG 

4441 TTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTG 
AAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGAC 

4501 TCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGC 
AGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACG 

4561 CGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGT 
GCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCA 

4621 CGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTG 
GCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGAC 

4681 CAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGAC 
GTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTG 

4741 AATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGG 
TTAGTGCGAGGG6GTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCC 

4801 GAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTC 
CTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAG 

4861 GTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGA 
CAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCT 

4921 GACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCA 
CTGAT6TCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGT 

4981 TCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATC 
AGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAG 

5041 CCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTG 
GGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACAC 

5101 CGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCT 
GC6ATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGA 

5161 CAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGA 
GTTCGGGTGG6AGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACT 

5221 AATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGA 
TTAGTGGGACTGCGTGGGTCA6TGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCT 




FIG. 62H 



5281 GGTC6TCACGAGCACCT6G6T6CTCGTTGGC6GCGTCCTGGCTGCTTTGGCCGCGTATTG 
CCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAAC 

5341 CCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAAT 
GGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTA 

5401 CATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCA 
GTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGT 

5461 CTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGG 
GAATGGCATGTAGCTCGTJCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCC 

5521 CCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTG 
G6AGGACGTCTGGCGCAG6GCAGTCCGTCTCCAATAGCGG6GACGACA6GTCTGGTTGAC 

5581 GCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATA 
CGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTAT 

5641 CTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTAC 
GAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATG 

5701 AGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGG 
TCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCC 

5761 GTGG6TGGCTGCCCAGCTCGCC6CCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTT 
CACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAA 

5821 AGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGG 
TCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCC 

5881 GTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCC 
CATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGG 

5941 CTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGT 
GAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCA 

6001 CGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCA 
GCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACGGGGCCCGCTCCCCCGTCACGT 

6061 GTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGGACTA 
CACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGAT 

6121 CGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAAC 
GCACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTG 

6181 CCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGG 
GGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCC 

6241 TTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTG 
AAGGACCGATTCCGTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGAC 

6301 GCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGG 
CGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCC 

6361 GTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGA 
CATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACT 

6421 GATCACT6GACATGTCAAAAACGG6ACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAA 
CTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTT 

6481 CATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCC 
GTACACCTCACCCT6GAAGGGGTAATTACGGATGTGGTGCCCGGGGAGATGGGGGGAAGG 

6541 TGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAG 
ACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTC 




FIG. 621 

6601 GCAGGTGGGGGACTTCCACTAC6TGACGGGTATGACTACTGACAATCTCAAATGCCC6TG 
CGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCAC 

6661 CCAG6TCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGC 
GGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACG 

6721 GCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATA 
CGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCTTAT 

6781 CCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCAT 
GGGCCATCCCAGCGTTAATGGAACGCTCGG6CTTGGCCTGCACCGGCACAACTGCAGGTA 

6841 GCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATC 
CGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCCTAG 

6901 ACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAAC 
TGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTG 

6961 TTGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAG 
AACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTC 

7021 GCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGA 
CGTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCT 

7081 CTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAAT 
GAGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTA 

7141 CCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAA 
GGACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATT 

7201 CCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTG 
GGGGGGCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGAC 

7261 TCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGT 
AGGC6AAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCA 

7321 CCTCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTGGCAG 
GGAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTC 

7381 CTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTC 
GAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTT6TAGGAGACTCGGGCGGGGAAG 

7441 TGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGGA 
ACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCT 

7501 GCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTAGTGAGGCCAAGGC 
CGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCATCACTCCGGTTGCG 

7561 GGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCCGT6 
CCTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCAC 

7621 CGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCA 
GCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCAACGATGCAGTGGT 

7681 CAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATT 
GTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTGTAA 

7741 TGACAGACTGCAA6TTCTGGACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGC 
ACTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCG 

7801 GGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCC 
CCGCAGTTTTCACTTCCGATJGAACGATAGGCATCTCCTTCGAACGTCGGACTGCGGGGG 

7861 ACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCCGTTGCCATGCCAGAAA 
TGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTCTGCAGGCAACGGTACGGTCTTT 
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7921 GGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACAATGTAACACCAAT 
CCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGTTACATTGTGGTTA 

7981 AGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCG 
TCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGCAAGTCGGACTCTTCCCCCCAGC 

8041 TAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGATGGC 
ATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCACACGCTTTTCTACCG 

8101 TTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCTCCTACGGATTCCA 
AAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGAGGATGCCTAAGGT 

8161 ATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCC 
TATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGG 

8221 AATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCG 
TTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGC 

8281 TACCGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAA 
ATGCCTCCTCCGTTAGATGGTTACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTT 

8341 GTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTG 
CAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGAC 

8401 CGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCAC 
GCCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTG 

8461 TTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCT 
AACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGA 

8521 CGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGC 
GCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCG 

8581 GAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCC 
CTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGG. 

8641 ACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCA 
TGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGCACAGTCAGCGGGT 

8701 CGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAG 
GCTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTC 

8761 AGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCAT 
TCGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTA 

8821 GTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTAT 
CAAACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATA 

8881 AGCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCAT 
TCGGTCCCTGGTGGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTA 

8941 AGAACCACTTGATCTACCTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACT 
TCTTG6TGAACTAGATGGAGGTTAGTAAGTTTCTGAGGTACCGGAGTCGCGTAAAAGTGA 

9001 CCACAGTTACTCTCCAGGTGAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGT 
GGTGTCAATGAGAGGTCCACTTTAATTATCCCACCGGCGTACGGAGTCTTTTGAACCCCA 

9061 ACCGCCCTTGCGAGCTTGGAGACACCG6GCCCGGAGCGTCCGCGCTAGGCTTCTGGCCAG 
TGGCGGGAACGCTCGAACCTCTGTGGCCCGGGCCTCGCAGGCGCGATCCGAAGACCGGTC 

9121 AGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCT 
TCCTCCGTCCCGACGGTATACACCGTTCATGGAGAAGTTGACCCGTCATTCTTGTTTCGA 

9181 CAAAC 
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FIG. 79A 

AlaValAspPhelleProValGluAsnLeuGluThrThrMetArgSerProValPh Thr 
2 GCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCAT6AG6TCCCCGGT6TTCACG 
CGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGC 

^ ^ ^ ^ aa 

29 MAE1, 40 NLA111, 43 MNLl, 45 AVA2 NLA1V SAU96, 49 NCI1 SC 
RF1, 50 HPA11, 

AspAsnSerSerProProValValProGlnSerPheGlnValAlaHisLeuHisAlaPro 
6 2 GATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCC 
CTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGG 

69 MNLl, 83 BSP1286, 92 ALU1, 97 EC0R11 SCRF1, 106 HPH, 109 
MNLl, 113 NLA111, 

ThrGlySetGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLysVal 
122 ACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTG 
TGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCAC 

^ ^ AA A A AAA 

126 BBV FNU4H1, 127 NSPB11, 129 FNU4H1, 145 AVA2 NLA1V SAU96 
, 148 NCI1 SCRF1, 149 HPA11, 152 BBV FNU4H1, 156 NDE1, 161 B 
BV FNU4H1, 163 ALU1, 165 DDE1, 

LeuValLeuAsnProSerValAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLysAla 
182 CTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCT 
G ATCATG AG TTGGGG AG ACAACG ACG TTGTG ACCCG AAACCACG AATG TACAGG TTCCG A 

— * a •"«• a aa 

182 MAE1, 184 SCA1, 185 RSA1, 195 MNLl, 203 BBV FNU4H1, 228 
AFL111 NSPC1, 229 NLA111, 

HlsGlylleAspProAsnlleArgThrGlyValArgThrlleThrThrGlySerProIle 
242 CATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTCGCAGCCCCATC 
GTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTCTTAATGGTGACCGTCGGGGTAG 

AAAA a AAA A 

242 NLA111, 246 BIN1, 247 MBOl SAU3A, 248 CLA1, 249 TAQ1, 25 
1 BIN1 MBOl SAU3A, 264 AVA2 SAU96, 267 HPA11 NCI1 SCRF1, 271 
HPH, 291 BBV FNU4H1, 

ThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAsp 
302 ACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGAC 
TCCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTG 

302 MAJ32, 304 RSA1, 340 BSP1286 HGIA, 343 AVA1, 350 HAE11, 3 
51 HHA1, 

IlellelleCysAspGluCysHlsSerThrAspAlaThrSerlleLeuGlylleGlyThr 
362 ATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATTGGCACT 
TATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAACGGTGA 

AA A 

372 MAES, 391 FOKl, 392 SFAN1, 399 F0K1, 

ValLeuAspGlnAl aGluThr Al aGly Al aArgLeuVa 1 Va lLeuAl aThr Al a Thr Pro 
422 GTCCTTGACCAAGCAGAGACTGCGGGGGCG AGACTGGTTGTGCTCGCCACCGCCACCCCT 
CAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGA 

431 TTHTII2, 435 ALWN1, 461 BSP1286 HGIA, 479 MNLl, 





FIG. 79B 

ProGlySerValThrValProHisProAsnlleGluGluValAlaLeuS rThrThrGly 
482 CCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGA 
GGCCCGAGGCAGTGACACGGGGTAGGGTTGTM^ 

482 HPA11 NCI1 SCRF1, 484 BANll BSP1286, 485 NLA1V, 491 MAE3 
, 497 BSP1286, 503 FOK1, 513 TAQ1, 515 MNL1, 518 MNL1, 537 H 
PA11, 

GluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeu 
542 GAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTC 
CTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAG 



543 XH02, 544 BIN1 MB01 SAU3A, 571 MNL1, 573 TAQ1, 

IlePheCysHisSerLysLysLysCysAspGluLeuAlciAlaLysLeuValAlaLeuGly 
602 ATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCIGGTCGCATTGGGC 
TAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCG. 

603 MBOU, 619 MBOll, 638 FNU4H1/ 645 ALU1, 660 SFANl, 

IleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVallleProThrSerGlyAsp 
662 ATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGAT 
TAGTTACGGCACCGGATGATGGCGCCAGAACTCCACAGGCAGTAGGGCTGGTCGCCGCTA 

■»■» ^ A A A A 

672 HAE1, 673 HAE111, 682 NSPBll SAC 2, 683 THAI, 693 AFL111 
MAE2, 703 F0K1, 712 NSPBll, 714 FNU4H1, 

ValValValValMaThrAspAlaLeuMetTh^ 

722 GTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTG 

CAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCAC 

^ ^ a a a a a 

740 SFANl, 745 MNL1, 748 NLA111, 753 HPA11, 762 HPA11, 771 T 
AQ1, 773 HINF1, 778 HPH, 

IleAspCysAsnThrCysValThrGlnThrValAspPheSerLeuAspProThrPheThr 
782 ATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACC 
TATCTGACGTTATCCACACAGTGGGTCTGTCA^ 

794 AFL111 MAE2, 800 MAE3, 801 HPH, 813 TAQ1, 837 HPH, 

IleGluThrlleThrLeuProGlnAspAlaValSerArgThrGloArgArgGlyArgThr 
842 ATTGAGACAATCACGCTCCCCCAAGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACT 
TAACTCTGXTAGTGCGAGGGGGTTCTACGACAGAGGGCGTGAGTTGGAGCCCCGTCCTGA 

866 SFANl/ 886 MAE2, 

GlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGlyMet 
902 GGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATG 
CCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTG^C^ 

914 EC0R11 SCRF1, 918 SFANl, 934 BAN1 NLA1V, 938 HPA11 NCI1 
SCRF1, 945 HAE11, 946 HHA1, 948 BGL1, 951 MNL1, 954 HP All, 9 
57 NSPC1, 958 NLAlll, 

PheAspSerSerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeuThr 
962 TTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACG 
AAGCT6A6CAGGCAG6At^CACTCAC6ATACTGCGTCCGACACGAACCATACTCGA6T6C 

963 TAQ1, 965 HINF1, 976 MNL1, 992 HGA1, 1003 TTHIIX2, 1013 
BANll BSP1286 HGIA SAC1, 1014 ALU1, 
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1562 



1051 RSA1, 1054 NIAlll, 1063 AVA1 NCI1 SCRF1 SMA1, 1064 HPA1 
1 NCI1 SCRF1, 1081 ECOR11 SCRF1, 

GlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAlaHis 
CAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCAC 
GTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTG 

1084 AVA2 SAU96 , 1103 MNL1, 1106 AHA11, 1107 HGA1, 1117 HAE1 
STU1, 1118 HAE111, 1120 MNI*1, 1133 SFAN1, 

PheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGlnAla 
TTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCC 
AAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCG^TGGITCTO 

1183 ECOR11 SCRF1, 1192 RSA1, 1201 DRA3, 

ThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeu 
ACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTG 
TGGCACACGCGATCCCGAGTTCGGGGAGG^ 

1209 HHA1, 1212 MAE1, 1215 BAN 11 BSP1286, 1226 MNL1, 1239 NL 
A1V, 1240 AVA2 SAU96, 1256 TTHIII2, 1261 HINF1, 

IleArgl^uLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAlaVal 
ATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTT 
TAAGCGGAGTTCGGGTG^A^TACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAA 

1267 MNL1/ 1279 MNia, 1282 NCOl, 1283 NIAlll, 1286 SAU96, 12 
87 HAE111, 1313 HAE11, 1314 HHA1, 

GlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSerAla 
CAGAATGAAATCACCCTGACGCACCGAGTCACCAAATACATCATGACATGCATGTCGGCC 
GTCTTACTTTAGTGGGACTGCG TGGG TCM TGGTTTATGTAGTACTGTACGTACAGCCGG 

1332 HPH, 1339 HGA1, 1349 MAE3, 1350 HPH, 1363 NIAlll, 1367 
NSPC1, 1368 NIAlll, 1369 AVA3 NSI1 , 1371 NSPC1, 1372 NIAlll, 
1377 CFR1 XMA3, 1378 HAE111, 

AspI^uGluValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeuAla 
GACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCC 
CTGGACCTCC^CAGTGCTCGTGGACCCACGAGCAACCGCOT 

1384 EC0R11 SCRF1, 1385 GSU1, 1388 MNL1, 1394 MAE3, 1399 BSP 
1286 HGIA/ 1404 ECOR11 SCRF1, 1409 BSP1286 HGIA, 1419 FNU4H1 
, 1421 AHA11, 1422 HGA1, 1426 ECOR11 SCRF1, 1430 BBV FNU4H1, 
1437 CFR1, 1438 HAE111, 1439 FNU4H1, 1441 THAI, 

AlaTyrCysI^uScrThrGlyCysValVallleValGlyArgValValLeuSerGlyLys 
GCGTATTGCCTGTCAACAGGCTGOGTt^TCATAGTGGGCAGGGTCGTCTTGTCCGGGAAG 
CGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTC 

1453 HINC11, 1461 BBV FNU4H1, 1494 HPAll NCI1 SCRF1, 1501 MA 
El, 

ProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGluCys 
CCGGCAATCATACCIX^CAGGGAAGTCCTCTACCGAGAGTTOGATGAGATGGAAGAGTGC 
^CCGTTAGTATGGACTGTCCCTTCAGGAGAIGGCTCTCAAGCTACTCT 

1502 HPAll, 1528 MNLl, 1542 TAQ1, 1553 MBOll, 1558 BSP1286 H 
GIA, 

SerGlnHisI^uProTyrlleGluGlnGlyMetMetl^uAlaGluGlnPheLysGlnLys 
TCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAG 



1563 DDE1 . 1576 RSA1, 1581 TAQ1/ 1590 FOK1, 1594 SFAN1, 1612 
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TTHIII2, 1621 HAE111 SAU96, 



FIG. 79D 



AlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGIn 
1622 GCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAG 
CGTOAGCCGGAG^CGTCTG^GCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTC 

1624 MNL1, 1628 HAE111, 1630 MNL1, 1634 PSTl, 1639 TTHIII1, 
1642 THAI/ 1643 HGA1, 1658 MNL1, 

ThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGly 

1682 ACGAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGG 

TGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCC 
aa a 

1697 AVA1 XHOl, 1698 TAQ1, 1718 NDEl, 

IleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMet 
1742 ATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTC 

TATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTAC- 

A A A aa 

1762 HINC11, 1768 BBV FNU4H1/ 1772 BCOR11 SCRF1, 1775 BSTE2/ 
1776 MAE3, 

AlaPheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlle 
1802 GCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATA 
CGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTCGGAGGAGAAGTTGTAT 

AAA ^ A A AAA 

1809 ALWN1 NSPB11 PVU11, 1810 ALU1, 1811 BBV FNU4H1, 1817 MA 
E3, 1818 HPH/ 1836 MAE1, 1846 MNL1, 1849 MNL1, 1851 MBOll, 

LeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGly 
1862 TTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGC 
AACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCG 

A A A AA A A A A 

1877 BBV FNU4H1, 1884 ALU1, 1889 FNU4H1, 1895 NCI1 SCRF1, 18 
96 HPAll, 1898 BAN1 NLA1V, 1901 FNU4H1, 1919 HAE11, 1920 HHA 
1/ 

AlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuXleAspIle 
1922 GCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATC 
CGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCC^ 

1927 DDE1, 1930 ALUl, 1934 AHA 11 BANl HAE11 NAR1 NIA1V, 1935 
HHA1, 1937 FNU4H1, 1966 AVA2 SAU96/ 1969 MNL1, 1978 FOKl, 

LeuMaGlyTyrGlyAlaGlyValAlaGlyAlaLe^^ 
1982 CTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGT^ 
GAACGTtXTCfcXACCGCG^ 

1995 HHA1, 1996 THAI, 2010 BAN 11 BSP1286 HGIA SAC1, 2011 ALU 
1, 2021 BSM1, 2029 MBOl SAU3A, 2032 NIA111, 2039 HPH, 

GluValProSerThrGluAspI^uValAsnl^ul^uProAlallel^uSerProGly 
204 2 GAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCC 
CTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGC 

AAA A A A A A A AA A 

2042 MHL1, 2044 AVA2 NIA1V SAU96, 2049 MNI#1, 2057 MNLl, 2059 
AVA2 SAU96 , 2060 TTHIII1, 2062 EC0R11 SCRF1, 2083 FOKl, 208 
6 MHL1, 2093 NCI1 SCRF1, 2094 HP All, 2096 NIA1V, 2097 BAM11 
BSP1286, 2101 MNL1, 

LeuValValGlyValValC^sAlaAlallel^iiArgArgHisValGlyProGlyGluGly 
2102 CTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGG 
GAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCC 

A A AA A A AA A 

2123 BBV FNU4H1, 2134 HHA1, 2136 NAE1, 2137 HPAll, 2142 KAB2 
, 2147 HAElll SAU96, 2149 AVA1 NCI1 SCRP1 SMA1, 21S0 HPAll N 



FIG. 79E 



CIl SCRFl, 2156 MNL1, 

AlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
2162 GCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
CGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 

2172 F0K1, 2179 HPA11, 2196 MNL1, 2199 AVA1 NCI1 SCRFl SMA1, 
2200 HPA11 NC11 SCRFl, 2205 NLA1V, 2210 NLA111, 
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GluPheGlySerVallleProThrSerGlyAspValValValValAlaThrAspAlaLeu 
1 GAATTCGGGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTC 
CTTAAGCCCAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAG 

A AA AAA AAA 

1 EC0R1, 7 NLA1V, 8 AVA2 SAU96, 15 F0K1, 24 NSPB11, 26 PNU4H 
1, 52 SFAN1, 57 MNL1, 60 NLAlll, 

MetThrGlyTyrThrGlyAspPheAspSerVallleAspCysAsnThrCysValThrGln 
6 1 ATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTGCAATACGTGTGTCACCCAG 
TACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTC 

A A AAA A AA 

65 HPA11, 74 HPA11, 83 TAQ1, 85 HINF1, 90 HPH/ 106 AFL111 MA 
E2, 112 MAE3, 113 HPH, 

ThrValAspPheSerLeuAspProThrPheThrlleGluThrlleThrLeuProGlnAsp 
121 ACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAAGAT 
TGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTTCTA 

A • A A 

125 TAQ1, 149 HPH, 178 SFAN1, 

AlaValSerArgThrGlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArg 
181 GCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGA 
CGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCT 

A A A 

198 MAE2, 226 ECOR11 SCRF1, 230 SFAN1, 

PheValAlaProGlyGluArgProProAlaCysSerThrArgProSerSerValSerAla 
241 TTTGTGGCACCGGGGGAGCGCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCC 
AAACACCGTGGCCCCCTCGCGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGG 

A A AA A A AA A A A A 

246 BAN1 NLA1V, 250 HPA11 NCI1 SCRF1, 257 HAE11, 258 HHA1, 2 
62 MNL1, 265 HPA11, 268 NSPC1, 269 NLAlll, 274 TAQ1, 276 HIN 
Fl, 287 MNL1, 296 BSP1286, 

Arglle 
301 CGAATTC 
GCTTAAG 

302 ECOR1, 
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10 20 30 40 

GAATTCGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 
X: :::::::::::::::::::::::::::::::::::::::: : 
/SSp CTCTCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 
550 560 570 580 590 600 

50 60 70 80 A 90 100 

AACAGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTAGT 

AACGGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGT 
610 620 630 640 650 660 

110 120 130 140 150 160 

GGTAGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTG 

AATCGCTCAGCTCCTCCGGATCCCACAAGCCA 

670 680 690 700 710 720 

170 180 190 200 210 220 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCTTGGC 



GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGT 
730 740 750 760 770 780 

230 240 250 260 270 280 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACTGGGGGGATCGCCGC 



AGTCCTGCTCCTATTTCCCGGCG 

790 800 810 820 830 840 

290 300 310 320 330 340 

CAAAACTACGGCTAGGCTTACTGGTCTCTTCAATTTAGGTGCCAAGCAGAACATCCAGCT 

« ■ • • • * mm m mm • ■ * • **• •*••••* 

• * • • • * • • • ■ »•« • * • • * •>••*••••••• ■ ■•■*■» 

CCACACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCT 
850 860 870 880 890 900 

350 360 370 380 390 400 

GATCAACACCAACGGCAGTTGGCACATCAACAGGACGGCCTTGAACTC 

*•••••*••»*••••••••••••■• •••••• ••••••••••••••••• • • 

•••••• •••*■*••»•»•••••*•* 

GATCAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCT 
910 920 930 940 950 960 

410 420 

CAACACCGGCTGGAATTC 
:X 

CAACACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCC 
970 980 990 1000 1010 1020 

FIG. 86 



^SfUM*^ AA #117-308 (putative envelope region^ 



FIG. 87 



1) HCT #18 (USA) 

2) JH23 (USA) 

3) JH 27 (USA) 

4) PBL-Th (USA) 

5) EC1 (Italy) 

6) HCV-1 (chimpanzee) 

C/M*-y->S 



3 clones sequenced 
? 
? 

2 clones sequenced 

3 clones sequenced 
multiple 

(p) 



1) 
2) 

3) 
4) 
5) 

6)RNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARALAHGVRVLEDGVNYATGNL 

1) H 

2) 

3) S T T 

4) L 

5) (F) s 

6) PGCSFSIFLLALLSCLTVPASAYQVRNSTGLYHVTNDCPNSSIVYEAADAILH 



1) (H) V V T 

2) A D V V K T 

3) S PVA N 

4) A A R T 

5) H V T 

6) TPGCVPCVREGNASRCWVAMTPTVATRDGKLPATQLRRHIDLLVGSATLCS 



1) 

2) I D 

3) D 
4) 

5) I 

6) ALYVGDLCGSVTLVGCOTFSPRRHWTTCX3CNCSI 
SUMMARY: "S" AA1 17-308 (93%) 

HCT018, PBL-Th, EC1 (Italy) have 97% homology with HCV-1 

JH23 and JH 27 have 96% and 95% homology with HCV-1 .respectively 



AA#300-438 ( C-terminal region of the putative envel ope region and 
amino -1/3 of NSH 



1) JH23 

2) JH27 

3) Japanese isolate (T. Miyamura) 

4) EC10 (Italy) 



5) HCV-1 (chimpanzee) 



? 
? 
? 

2 clones sequenced 
(one nt difference, which did not 
result in an amino acid change) 
multiple 
S < i > NSI 



T 

A 
A 



1) D 

2) D 

3) VS VM V 

4) 

5)TTC^CNCSIYPGHITGHRMAWDMMMNWSPTTALVMACttlBIPQAILDMIAGA 



1 ) M R ARSTA VA 

2 ) T YT N A R TQALT F 

3 ) L Y I M QH R VQ VT TLT 

4 ) A I A K TASLTA 

5) HWGVUVGIAYFSMVGNWAKVLVVLLLFAGVDAETHVTGGSAGHTVSGFVSL 

1 )PS R I | TV 

2) FT Dl I R AD 

3) FR S Kl V I R Q F 

4) FNL I I R N 

5) IJKPGAXQNVQLINrrNGSWHLNSTALNCNDSLNTGWL 



SUMMARY: NS 1 AA 330-660 

"Isolate" ZHomology (AA330-438) 

JH23 83 
JH27 80 
Japanese 73 
EC10 (Italy) 84 



^Homology CAA 38 3-405) 

57 
39 
48 
48 



FIG. 88 



FIG. 89A 

5' terminus 

CACTCCACCAT6AATCACTGCCCTGTGAGGAACTACTGTCTTCAC6CAGAAAGCGTCTAG 
CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
GTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG-300 

(Putative initiator methionine codon) 

G C 

GTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAA 

CACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGT 

TTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGA 

GCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAG 

GACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGC-600 

GGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCG 

TAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCAT 

GGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGG 

CGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAAGCTTCCTGGTTGCTCTTT 

C 

CTCTATCTTCCTTCTGGCCCTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGT-900 
GCGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTA 
CGAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAA 



CGCCTCGAGGTGTTGGGTGGCGATGACCCCTACGG 



CGCGACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGC 



CCTCTACGTGGGGGACCTATGCGGGTCTGTCTT 
TCCCAGGCGCCACTGGACGACGCAAGGTTGCAA" 



GGCCACCAGGGATGGCAAACTCCC 



TGTCGGCCAACTGTTCACCTTCTC-1200 
TGCTCTATCTATCCCGGCCATATAAC 



G 

GGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAAT 

GGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGG 

AGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGT 

GCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCA-1500 

CACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGAT 

CAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAA 

CACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGA 

GA6GCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTA 

TGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTG-1800 

CGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGT 

GGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATAC 

GGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGTTCGGTTGTACCTG 

GATGAACTCAACT6GATTCACCAAAGTGTGCGGAGCGCCTCCTTGT6TCATCGGAGGGGC 

GGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATA-2100 

C 

CTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCGACTACCCGTATAG 

GCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCAGGATGTACGTGGG 

AGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCT 

GGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTACACAGTGGCAGGT 

CCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCA-2^00 

GAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCGCGTCCTGGGCCAT 

TAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTG 

CTTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACT 

TAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTT 

TGCATGGTATTTGAA6GGTAA6TGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTG-2700 

GCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGT 

GGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATA 

TTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGA 

AGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCAT 
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QIIACTCAT6T6TGCT6TACACCCGACTCTGGTATTT6ACATCACCAAATTGCTGCTGGC-3000 

CGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCG 

CGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGT 

GCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCAC 

TCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGT 

CGTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCGTGCGG-3300 

TGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCC 

AGCCGATGGAATGGTaCCAAG^ 

GCAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCA 
AGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCAT 
CAATGGGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAA-3600 

GGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCC 
C 

GCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCAC 

GAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTC 

GCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGG 

GCACGCCGTGGGCATATTTAGGGCCGC6GTGTGCACCCGTGGAGTGGCTAAGGCGGTGGA-3900 

CTTI A I CCCTGT6 GAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTC 

CTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAG 

CGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACT 

CAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGAT 

T 

CGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTC-4200 

CACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAAT 

TTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGA 

CCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTC 

CGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCC 

TTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTG-4500 

TCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGC 

CGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGT 

/\ 

CGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTG 

CAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGAC 

AATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGG-4800 

GAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTC 

GTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGA 

GACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCA 

TCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATC 

CCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTG-5100 

CGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCT 

CAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGA 

AATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGA 

GGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTG 

CCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAAT-5100 

CATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCA 

CTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGG 

CCTCCTGCAGACCGCGTCGCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTG 

GCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATA 

CTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTAC-5700 

AGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGG 

GTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTT 

AGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGG 

GTATG6CGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCC 

CTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCC6GAGCCCTCGTAGT-6000 

CGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCA 

GTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTA 

CGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAAC 

CCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGG 
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TTCCTG6CTAA6GGACATCTGGGACTGGATATGCGAGGT6TT6AGCGACTTTAAGACCTG-6300 
GCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGG 
GTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGA 
GATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAA 
CATGT6GAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCC 
TGCGCCGAACTACACGTTCGGGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAG-6600 
GCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTG 
CCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGC 
GCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATA 
CCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCAT 
GCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATC-6900 
ACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAAC 
TTGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAG 
GCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGA 
CTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAAT 
CCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGAeTATAA-7200 
CCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTG 
TCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGT 
CCTCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTGGCAG 
CTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTC 
TGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGGA-7500 
GCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTAGTGAGGCCAACGC 
GGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCCGTG 
CGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCA 
CAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATT 
TGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGC-7800 
GGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCC 
ACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCCGTTGCCATGCCAGAAA 
GGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACAATGTAACACCAAT 
AGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCG 
TAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGATGGC-8100 
TTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCTCCTACGGATTCCA 
ATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCC 
AATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCG 
TACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAA 
GTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTG-8100 
CGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCAC 
TTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCT 



CGTGTGTGGCGACGACTTAGTCGTTA 
GAGCCTGAGAGCCTTCACGGAGGCTA 
ACAACCAGAATACGACTTGGAGCTCA 



rCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGC 
GACCAGGTACTCCGCCCCCCCTGGGGACCCCCC 

rAACATCATGCTCCTCCAACGTGTCAGTCGCCCA-8700 

CGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAG 
AGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCAT 
GTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTAT 
AGCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCAT 
AGAACCACTGGATCTACCTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACT-9000 
CCACAGTTACTCTCCAGGTGAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGT 

G 

ACCGCCCTTGCGAGCTTGGAGACACCGGGCCCGGAGCGTCCGCGCTAGGCTTCTGGCCAG 

AGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCT 

CAAACTCACTCCAATAGCG6CCGCTGGCCAGCTGGACTTGTCCGGCTGGTTCACGGCTGG 

CTACAGCGGGGGAGACATTTATCACAGCGTGTCTCATGCCCGGCCCCGCTGGATCTGGTT-9300 

TTGCCTACTCCTGCTTGCTGCAGGGGTAGGCATCTACCTCCTCCCCAACCGATGAAGGTT 

GGGGTAAACACTCC6GCCT 3' terminus 

Some clonal heterogeneities producing amino acid 
substitutions are shown. There are many other 
"silent mutations (not shown). 
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R T 

MSTNPKPQKKNKRNTNRRPQDVKFP6GGQIVGGVYLLPRR6PRLGVRATR 

KTSERSQPRGRRQPIPKARRPEGRTWAQPGYPWPLYGNEGCGWAGWLLSP-100 

RGSRPSWGPTDPRRRSRNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARA 

LAHGVRVLEDGVNYATGNLPGCSFSIFLLALLSCLTVPASAYQVRNSTGL-200 

YHVTNDCPNSSIVYEAADAILHTPGCVPGVREGNASRCWVAMTPTVATRD 

GKLPATQLRRHIDLLVGSATLCSALYVGDLCGSVFLVGQLFTFSPRRHWT-300 

V 

TQGCNCS IYPGHI TGHRMAWDMMMNWSPTTA LVMAQLLR I PQA I LDM I AG 
AHWGVLAGIAYFSMVGNWAKVLVVLLLFAGVDAETHVTGGSAGHTVSGFV-400 
SLLAPGAKQNVQL I NTNGSWHLNSTALNCNDSLNTGWLAGLFYHHKFNSS 
GCPERLASCRPLTDFDQGWGP I SYANGSGPDQRPYCWHYPPKPCG I VPAK-500 
SVCGPVYCFTPSPVVVGTTDRSGAPTYSWGENDTDVFVLNNTRPPLGNWF 
GCTWMNSTGFTKVCGAPPCVIGGAGNNTLHCPTDCFRKHPDATYSRCGSG-600 

I 

PWLTPRCLVDYPYRLWHYPCT I NYT I FKI RMYVGGVEHRLEAACNWTRGE 
RCDLEDRDRSELSPLLLTTTQWQVLPCSFTTLPALSTGLIHLHQNIVDVQ-700 
YLYGVGSSIASWAIKWEYVVLLFLLLADARVCSCLWMMLLISQAEAALEN 
LVILNAASLAGTHGLVSFLVFFCFAWYLKGKWVPGAVYTFYGMWPLLLLL-800 

(N) 

LALPQRAYALDTEVAASCGGVVLVGLMALTLSPYYKRYISWCLWWLQYFL 
TRVEAQLHVW I PPLNVRGG RDAV I LLMCA VHPTL VFD I TKLLL AVFGPLW-900 
ILQASLLKVPYFVRVQGLLRFCALARKMIGGHYVQMVIIKLGALTGTYVY 
NHLTPLRDWAHNGLRDLAVAVEPVVFSQMETKLITWGADTAACGDIINGL-1000 
PVSARRGREILLGPADGMVSKGWRLLAPITAYAQQTRGLLGCI ITSLTGR 
DKNQVEGEVQ I VSTAA QTF L ATC I NGVCWTVYHGAGTRTI ASPKGPVI QM-1 100 

S T 

YTNVDQDLVGWPAPQGSRSLTPCTCGSSDLYLVTRHADVIPVRRRGDSRG 

SLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGVAKAVDFIPVEN-1200 

LETTMRSPVFTDNSSPPWPQSFQVAHLHAPTGSGKSTKVPAAYAAQGYK 

L 

VLVLNPSVAATLGFGA YMSKAHG IDPNI RTGVRT I TTGSP I TYSTYGKFL- 1300 

ADGGCSGGAYDI I ICDECHSTDATSI LGIGTVLDQAETAGARLVVLATAT 

PPGSVTVPHPNIEEVALSTTGEIPFYGKAIPLEVIKGGRHLIFCHSKKKC-iaOO 

DELAAKLVALGINAVAYYRGLDVSVIPTSGDVVVVATDALMTGYTGDFDS 

Y (S) 
VIDCNTCVTQTVDFSLDPTFTIETITLPQDAVSRTQRRGRTGRGKPGIYR-1500 
FVAPGERPSGMFDSSVLCECYDAGCAWYELTPAETTVRLRAYMNTPGLPV 
CQDHLEFWEGVFTGLTHIDAHFLSQTKQSGENLPYLVAYQATVCARAQAP-1600 
PPSWDQMWKCLIRLKPTLHGPTPLLYRLGAVQNEITLTHPVTKYIMTCMS 
ADLEVVTSTWVLVGGVLAALAAYCLSTGCVVIVGRVVLSGKPAIIPDREV-1700 
LYREFDEMEECSQHLPYIEQGMMLAEQFKQKALGLLQTASRQAEVIAPAV 
QTNWQKLETFWAKHMWNF I SG I QYLAGLSTLPGNPA I ASLMAFTAAVTSP-1800 
LTTSQTLLFN I LGGWVA AQLA APG AATAFVGAGL AGAA I GSVGLGKVL I D 
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(6) 

1 LAGYGAGVAGALVAFKI MSGEVPSTEDLVNLLPA I LSPGALVVGVVCAA- 1900 

(HO 

I LRRHVGPGEGAVQWMNRL I AFASRGNHVSPTHYVPESD AAARVTA I LSS 
LTVTQLLRRLHQWISSECTTPCSGSWLRDIWDWICEVLSDFKTWLKAKLM-2000 

(V) 

PQLPGIPFVSCQRGYKGVWRGDGIMHTRCHCGAEITGHVKNGTMRIVGPR 

TCRNMWSGTFPI NAYTTGPCTPLPAPNYTFALWRVSAEEYVEI RQVGDFH-2100 

YVTGMTTDNLKCPCQVPSPEFFTELDGVRLHRFAPPCKPLLREEVSFRVG 

LHEYPVGSQLPCEPEPDVAVLTSMLTDPSHITAEAAGRRLARGSPPSVAS-2200 

SSASQLSAPSLKATCTANHDSPDAELIEANLLWRQEMGGNITRVESENKV 

VILDSFDPLVAEEDEREISVPAEILRKSRRFAQALPVWARPDYNPPLVET-2300 

(S) 

WKKPDYEPPVVHGCPLPPPKSPPVPPPRKKRTVVLTESTLSTALAELATR 

(FA) 

SFGSSSTSGITGDNTTTSSEPAPSGCPPDSDAESYSSMPPLEGEPGDPDL-2400 

SDGSWSTVSSEANAEDVVCCSMSYSWTGALVTPCAAEEQKLPINALSNSL 

LRHHNLVYSTTSRSACQRQKKVTFDRLQVLDSHYQDVLKEVKAAASKVKA-2500 

(F) 

NLLSVEEACSLTPPHSAKSKFGYGAKDVRCHARKAVTHINSVWKDLLEDN 

VTPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVT-2600 

KLPLAVMGSSYGFQYSPGQRVEFLVQAWKSKKTPMGFSYDTRCFDSTVTE 

(G) 

SDIRTEEAIYQCCDLDPQARVAIKSLTERLYVGGPLTNSRGENCGYRRCR-2700 
ASGVLTTSCGNTLTCYIKARAACRAAGLQDCTMLVCGDDLVVICESAGVQ 
EDAASLRAFTEAMTRYSAPPGDPPQPEYDLELITSCSSNVSVAHDGAGKR-2800 
VYYLTRDPTTPLARAAWETARHTFVNSWLGNIIMFAPTLWARMILMTHFF 
SVLIARDQLEQALDCEIYGACYSIEPLDLPPI IQRLHGLSAFSLHSYSPG-2900 

G 

EINRVAACLRKLGVPPLRAWRHRARSVRARLLARGGRAAICGKYLFNWAV 

(P) 

RTKLKLTPIAAAGQLDLSGWFTAGYSGGDIYHSVSHARPRWIWFCLLLLA-3000 
AGVGIYLLPNRO-3011 

Stop codon 



( ) = Heterogeneity due possibly 
to 5' or 3' terminal cloning 
artefact. 
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